| Distinct | 756 |
|---|
| Distinct (%) | 71.9% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 16 |
|---|
| Median length | 15 |
|---|
| Mean length | 14.8973384 |
|---|
| Min length | 13 |
|---|
Characters and Unicode
| Total characters | 15672 |
|---|
| Distinct characters | 13 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 574 ? |
|---|
| Unique (%) | 54.6% |
|---|
Sample
| 1st row | 1/13/2021 14:25 |
|---|
| 2nd row | 1/13/2021 14:37 |
|---|
| 3rd row | 1/17/2021 19:04 |
|---|
| 4th row | 1/17/2021 19:09 |
|---|
| 5th row | 1/18/2021 12:08 |
|---|
| Value | Count | Frequency (%) |
| 1/20/2021 | 383 | 18.2% |
| 1/21/2021 | 235 | 11.2% |
| 2/19/2021 | 94 | 4.5% |
| 2/21/2021 | 79 | 3.8% |
| 1/23/2021 | 36 | 1.7% |
| 1/22/2021 | 29 | 1.4% |
| 1/7/2022 | 29 | 1.4% |
| 1/26/2021 | 22 | 1.0% |
| 1/19/2021 | 22 | 1.0% |
| 4/23/2021 | 13 | 0.6% |
| Other values (583) | 1162 | 55.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3847 | 24.5% |
| 1 | 3645 | 23.3% |
| / | 2104 | 13.4% |
| 0 | 1806 | 11.5% |
| 1052 | 6.7% |
| : | 1052 | 6.7% |
| 4 | 492 | 3.1% |
| 3 | 447 | 2.9% |
| 5 | 414 | 2.6% |
| 9 | 258 | 1.6% |
| Other values (3) | 555 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11464 | 73.1% |
| Other Punctuation | 3156 | 20.1% |
| Space Separator | 1052 | 6.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3847 | 33.6% |
| 1 | 3645 | 31.8% |
| 0 | 1806 | 15.8% |
| 4 | 492 | 4.3% |
| 3 | 447 | 3.9% |
| 5 | 414 | 3.6% |
| 9 | 258 | 2.3% |
| 7 | 196 | 1.7% |
| 6 | 188 | 1.6% |
| 8 | 171 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 2104 | 66.7% |
| : | 1052 | 33.3% |
Space Separator
| Value | Count | Frequency (%) |
| 1052 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 15672 | 100.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 3847 | 24.5% |
| 1 | 3645 | 23.3% |
| / | 2104 | 13.4% |
| 0 | 1806 | 11.5% |
| 1052 | 6.7% |
| : | 1052 | 6.7% |
| 4 | 492 | 3.1% |
| 3 | 447 | 2.9% |
| 5 | 414 | 2.6% |
| 9 | 258 | 1.6% |
| Other values (3) | 555 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15672 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 3847 | 24.5% |
| 1 | 3645 | 23.3% |
| / | 2104 | 13.4% |
| 0 | 1806 | 11.5% |
| 1052 | 6.7% |
| : | 1052 | 6.7% |
| 4 | 492 | 3.1% |
| 3 | 447 | 2.9% |
| 5 | 414 | 2.6% |
| 9 | 258 | 1.6% |
| Other values (3) | 555 | 3.5% |
| Distinct | 960 |
|---|
| Distinct (%) | 91.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 37 |
|---|
| Median length | 34 |
|---|
| Mean length | 23.13403042 |
|---|
| Min length | 14 |
|---|
Characters and Unicode
| Total characters | 24337 |
|---|
| Distinct characters | 55 |
|---|
| Distinct categories | 6 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 877 ? |
|---|
| Unique (%) | 83.4% |
|---|
Sample
| 1st row | test@gmail.com |
|---|
| 2nd row | liyanashuib@gmail.com |
|---|
| 3rd row | azirasuhot@gmail.com |
|---|
| 4th row | haslina_m@um.edu.my |
|---|
| 5th row | noorain277@um.edu.my |
|---|
| Value | Count | Frequency (%) |
| wie180036@siswa.um.edu.my | 4 | 0.4% |
| arinaariesyha6@gmail.com | 3 | 0.3% |
| dpm19086008@mahsastudent.edu.my | 3 | 0.3% |
| janeesamin@gmail.com | 3 | 0.3% |
| eizzahamin99@gmail.com | 3 | 0.3% |
| piaabalqis@gmail.com | 3 | 0.3% |
| meowfaa@gmail.con | 3 | 0.3% |
| aaasnathaniel@gmail.com | 3 | 0.3% |
| nrl.ezrina08@gmail.com | 2 | 0.2% |
| ikhmalzzz20@gmail.com | 2 | 0.2% |
| Other values (950) | 1023 | 97.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2838 | 11.7% |
| m | 2491 | 10.2% |
| i | 2123 | 8.7% |
| . | 1645 | 6.8% |
| l | 1125 | 4.6% |
| @ | 1052 | 4.3% |
| o | 1002 | 4.1% |
| u | 984 | 4.0% |
| s | 929 | 3.8% |
| c | 851 | 3.5% |
| Other values (45) | 9297 | 38.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18609 | 76.5% |
| Decimal Number | 2977 | 12.2% |
| Other Punctuation | 2697 | 11.1% |
| Uppercase Letter | 40 | 0.2% |
| Connector Punctuation | 13 | 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2838 | 15.3% |
| m | 2491 | 13.4% |
| i | 2123 | 11.4% |
| l | 1125 | 6.0% |
| o | 1002 | 5.4% |
| u | 984 | 5.3% |
| s | 929 | 5.0% |
| c | 851 | 4.6% |
| g | 829 | 4.5% |
| n | 816 | 4.4% |
| Other values (16) | 4621 | 24.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 8 | 20.0% |
| P | 8 | 20.0% |
| U | 6 | 15.0% |
| A | 4 | 10.0% |
| I | 2 | 5.0% |
| S | 2 | 5.0% |
| F | 2 | 5.0% |
| D | 1 | 2.5% |
| M | 1 | 2.5% |
| J | 1 | 2.5% |
| Other values (5) | 5 | 12.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 802 | 26.9% |
| 2 | 453 | 15.2% |
| 1 | 403 | 13.5% |
| 9 | 244 | 8.2% |
| 3 | 211 | 7.1% |
| 5 | 197 | 6.6% |
| 8 | 190 | 6.4% |
| 7 | 169 | 5.7% |
| 4 | 157 | 5.3% |
| 6 | 151 | 5.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1645 | 61.0% |
| @ | 1052 | 39.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 13 | 100.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18649 | 76.6% |
| Common | 5688 | 23.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2838 | 15.2% |
| m | 2491 | 13.4% |
| i | 2123 | 11.4% |
| l | 1125 | 6.0% |
| o | 1002 | 5.4% |
| u | 984 | 5.3% |
| s | 929 | 5.0% |
| c | 851 | 4.6% |
| g | 829 | 4.4% |
| n | 816 | 4.4% |
| Other values (31) | 4661 | 25.0% |
Common
| Value | Count | Frequency (%) |
| . | 1645 | 28.9% |
| @ | 1052 | 18.5% |
| 0 | 802 | 14.1% |
| 2 | 453 | 8.0% |
| 1 | 403 | 7.1% |
| 9 | 244 | 4.3% |
| 3 | 211 | 3.7% |
| 5 | 197 | 3.5% |
| 8 | 190 | 3.3% |
| 7 | 169 | 3.0% |
| Other values (4) | 322 | 5.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24337 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2838 | 11.7% |
| m | 2491 | 10.2% |
| i | 2123 | 8.7% |
| . | 1645 | 6.8% |
| l | 1125 | 4.6% |
| @ | 1052 | 4.3% |
| o | 1002 | 4.1% |
| u | 984 | 4.0% |
| s | 929 | 3.8% |
| c | 851 | 3.5% |
| Other values (45) | 9297 | 38.2% |
| Distinct | 1 |
|---|
| Distinct (%) | 0.1% |
|---|
| Missing | 16 |
|---|
| Missing (%) | 1.5% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 5 |
|---|
| Median length | 5 |
|---|
| Mean length | 5 |
|---|
| Min length | 5 |
|---|
Characters and Unicode
| Total characters | 5180 |
|---|
| Distinct characters | 4 |
|---|
| Distinct categories | 2 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Agree |
|---|
| 2nd row | Agree |
|---|
| 3rd row | Agree |
|---|
| 4th row | Agree |
|---|
| 5th row | Agree |
|---|
| Value | Count | Frequency (%) |
| agree | 1036 | 100.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2072 | 40.0% |
| A | 1036 | 20.0% |
| g | 1036 | 20.0% |
| r | 1036 | 20.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4144 | 80.0% |
| Uppercase Letter | 1036 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2072 | 50.0% |
| g | 1036 | 25.0% |
| r | 1036 | 25.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1036 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5180 | 100.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2072 | 40.0% |
| A | 1036 | 20.0% |
| g | 1036 | 20.0% |
| r | 1036 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5180 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2072 | 40.0% |
| A | 1036 | 20.0% |
| g | 1036 | 20.0% |
| r | 1036 | 20.0% |
| Distinct | 2 |
|---|
| Distinct (%) | 0.2% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 6 |
|---|
| Median length | 6 |
|---|
| Mean length | 5.330798479 |
|---|
| Min length | 4 |
|---|
Characters and Unicode
| Total characters | 5608 |
|---|
| Distinct characters | 6 |
|---|
| Distinct categories | 2 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Female |
|---|
| 2nd row | Female |
|---|
| 3rd row | Female |
|---|
| 4th row | Female |
|---|
| 5th row | Female |
|---|
| Value | Count | Frequency (%) |
| female | 700 | 66.5% |
| male | 352 | 33.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1752 | 31.2% |
| a | 1052 | 18.8% |
| l | 1052 | 18.8% |
| F | 700 | 12.5% |
| m | 700 | 12.5% |
| M | 352 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4556 | 81.2% |
| Uppercase Letter | 1052 | 18.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1752 | 38.5% |
| a | 1052 | 23.1% |
| l | 1052 | 23.1% |
| m | 700 | 15.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 700 | 66.5% |
| M | 352 | 33.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5608 | 100.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1752 | 31.2% |
| a | 1052 | 18.8% |
| l | 1052 | 18.8% |
| F | 700 | 12.5% |
| m | 700 | 12.5% |
| M | 352 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5608 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1752 | 31.2% |
| a | 1052 | 18.8% |
| l | 1052 | 18.8% |
| F | 700 | 12.5% |
| m | 700 | 12.5% |
| M | 352 | 6.3% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 19 |
|---|
| Median length | 13 |
|---|
| Mean length | 13.67110266 |
|---|
| Min length | 3 |
|---|
Characters and Unicode
| Total characters | 14382 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Undergraduate |
|---|
| 2nd row | Postgraduate |
|---|
| 3rd row | Postgraduate |
|---|
| 4th row | Postgraduate |
|---|
| 5th row | Postgraduate |
|---|
| Value | Count | Frequency (%) |
| undergraduate | 901 | 85.6% |
| certificate/diploma | 134 | 12.7% |
| master | 9 | 0.9% |
| postgraduate | 5 | 0.5% |
| phd | 3 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2089 | 14.5% |
| e | 2084 | 14.5% |
| r | 1950 | 13.6% |
| d | 1807 | 12.6% |
| t | 1188 | 8.3% |
| g | 906 | 6.3% |
| u | 906 | 6.3% |
| U | 901 | 6.3% |
| n | 901 | 6.3% |
| i | 402 | 2.8% |
| Other values (13) | 1248 | 8.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13059 | 90.8% |
| Uppercase Letter | 1189 | 8.3% |
| Other Punctuation | 134 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2089 | 16.0% |
| e | 2084 | 16.0% |
| r | 1950 | 14.9% |
| d | 1807 | 13.8% |
| t | 1188 | 9.1% |
| g | 906 | 6.9% |
| u | 906 | 6.9% |
| n | 901 | 6.9% |
| i | 402 | 3.1% |
| o | 139 | 1.1% |
| Other values (7) | 687 | 5.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 901 | 75.8% |
| D | 137 | 11.5% |
| C | 134 | 11.3% |
| M | 9 | 0.8% |
| P | 8 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 134 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14248 | 99.1% |
| Common | 134 | 0.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2089 | 14.7% |
| e | 2084 | 14.6% |
| r | 1950 | 13.7% |
| d | 1807 | 12.7% |
| t | 1188 | 8.3% |
| g | 906 | 6.4% |
| u | 906 | 6.4% |
| U | 901 | 6.3% |
| n | 901 | 6.3% |
| i | 402 | 2.8% |
| Other values (12) | 1114 | 7.8% |
Common
| Value | Count | Frequency (%) |
| / | 134 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14382 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2089 | 14.5% |
| e | 2084 | 14.5% |
| r | 1950 | 13.6% |
| d | 1807 | 12.6% |
| t | 1188 | 8.3% |
| g | 906 | 6.3% |
| u | 906 | 6.3% |
| U | 901 | 6.3% |
| n | 901 | 6.3% |
| i | 402 | 2.8% |
| Other values (13) | 1248 | 8.7% |
| Distinct | 168 |
|---|
| Distinct (%) | 16.0% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 47 |
|---|
| Median length | 35 |
|---|
| Mean length | 19.27376426 |
|---|
| Min length | 2 |
|---|
Characters and Unicode
| Total characters | 20276 |
|---|
| Distinct characters | 59 |
|---|
| Distinct categories | 7 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 108 ? |
|---|
| Unique (%) | 10.3% |
|---|
Sample
| 1st row | Veterinary |
|---|
| 2nd row | Computing |
|---|
| 3rd row | Computing |
|---|
| 4th row | 3:00 |
|---|
| 5th row | Humanities |
|---|
| Value | Count | Frequency (%) |
| technology | 219 | 10.4% |
| computer | 215 | 10.2% |
| science/information | 214 | 10.1% |
| and | 208 | 9.9% |
| architecture | 104 | 4.9% |
| building | 103 | 4.9% |
| engineering | 93 | 4.4% |
| economy | 69 | 3.3% |
| education | 62 | 2.9% |
| science | 61 | 2.9% |
| Other values (138) | 762 | 36.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 2111 | 10.4% |
| i | 1747 | 8.6% |
| e | 1730 | 8.5% |
| o | 1584 | 7.8% |
| c | 1379 | 6.8% |
| t | 1138 | 5.6% |
| 1095 | 5.4% |
| r | 1028 | 5.1% |
| a | 1009 | 5.0% |
| u | 757 | 3.7% |
| Other values (49) | 6698 | 33.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16709 | 82.4% |
| Uppercase Letter | 2241 | 11.1% |
| Space Separator | 1095 | 5.4% |
| Other Punctuation | 226 | 1.1% |
| Decimal Number | 3 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 407 | 18.2% |
| I | 274 | 12.2% |
| E | 264 | 11.8% |
| T | 262 | 11.7% |
| C | 247 | 11.0% |
| A | 238 | 10.6% |
| B | 174 | 7.8% |
| L | 82 | 3.7% |
| M | 55 | 2.5% |
| P | 41 | 1.8% |
| Other values (15) | 197 | 8.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2111 | 12.6% |
| i | 1747 | 10.5% |
| e | 1730 | 10.4% |
| o | 1584 | 9.5% |
| c | 1379 | 8.3% |
| t | 1138 | 6.8% |
| r | 1028 | 6.2% |
| a | 1009 | 6.0% |
| u | 757 | 4.5% |
| g | 677 | 4.1% |
| Other values (14) | 3549 | 21.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 214 | 94.7% |
| & | 7 | 3.1% |
| , | 3 | 1.3% |
| ' | 1 | 0.4% |
| : | 1 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | 66.7% |
| 3 | 1 | 33.3% |
Space Separator
| Value | Count | Frequency (%) |
| 1095 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18950 | 93.5% |
| Common | 1326 | 6.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 2111 | 11.1% |
| i | 1747 | 9.2% |
| e | 1730 | 9.1% |
| o | 1584 | 8.4% |
| c | 1379 | 7.3% |
| t | 1138 | 6.0% |
| r | 1028 | 5.4% |
| a | 1009 | 5.3% |
| u | 757 | 4.0% |
| g | 677 | 3.6% |
| Other values (39) | 5790 | 30.6% |
Common
| Value | Count | Frequency (%) |
| 1095 | 82.6% |
| / | 214 | 16.1% |
| & | 7 | 0.5% |
| , | 3 | 0.2% |
| 0 | 2 | 0.2% |
| ( | 1 | 0.1% |
| ' | 1 | 0.1% |
| 3 | 1 | 0.1% |
| : | 1 | 0.1% |
| ) | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20276 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 2111 | 10.4% |
| i | 1747 | 8.6% |
| e | 1730 | 8.5% |
| o | 1584 | 7.8% |
| c | 1379 | 6.8% |
| t | 1138 | 5.6% |
| 1095 | 5.4% |
| r | 1028 | 5.1% |
| a | 1009 | 5.0% |
| u | 757 | 3.7% |
| Other values (49) | 6698 | 33.0% |
| Distinct | 238 |
|---|
| Distinct (%) | 22.7% |
|---|
| Missing | 2 |
|---|
| Missing (%) | 0.2% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 53 |
|---|
| Median length | 49 |
|---|
| Mean length | 16.71809524 |
|---|
| Min length | 1 |
|---|
Characters and Unicode
| Total characters | 17554 |
|---|
| Distinct characters | 57 |
|---|
| Distinct categories | 8 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 161 ? |
|---|
| Unique (%) | 15.3% |
|---|
Sample
| 1st row | test |
|---|
| 2nd row | UM |
|---|
| 3rd row | UM |
|---|
| 4th row | University Malaya |
|---|
| 5th row | UM |
|---|
| Value | Count | Frequency (%) |
| malaya | 560 | 21.7% |
| university | 453 | 17.6% |
| of | 303 | 11.7% |
| universiti | 181 | 7.0% |
| uitm | 133 | 5.2% |
| um | 88 | 3.4% |
| mara | 73 | 2.8% |
| pasir | 69 | 2.7% |
| mas | 67 | 2.6% |
| kolej | 62 | 2.4% |
| Other values (145) | 590 | 22.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2124 | 12.1% |
| i | 1775 | 10.1% |
| 1633 | 9.3% |
| y | 981 | 5.6% |
| M | 939 | 5.3% |
| U | 884 | 5.0% |
| n | 858 | 4.9% |
| e | 814 | 4.6% |
| r | 781 | 4.4% |
| s | 777 | 4.4% |
| Other values (47) | 5988 | 34.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11779 | 67.1% |
| Uppercase Letter | 4101 | 23.4% |
| Space Separator | 1633 | 9.3% |
| Other Punctuation | 13 | 0.1% |
| Open Punctuation | 11 | 0.1% |
| Close Punctuation | 11 | 0.1% |
| Decimal Number | 4 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2124 | 18.0% |
| i | 1775 | 15.1% |
| y | 981 | 8.3% |
| n | 858 | 7.3% |
| e | 814 | 6.9% |
| r | 781 | 6.6% |
| s | 777 | 6.6% |
| t | 744 | 6.3% |
| l | 694 | 5.9% |
| v | 588 | 5.0% |
| Other values (14) | 1643 | 13.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 939 | 22.9% |
| U | 884 | 21.6% |
| A | 403 | 9.8% |
| I | 356 | 8.7% |
| T | 261 | 6.4% |
| K | 185 | 4.5% |
| S | 176 | 4.3% |
| R | 135 | 3.3% |
| N | 123 | 3.0% |
| E | 104 | 2.5% |
| Other values (14) | 535 | 13.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | 25.0% |
| 0 | 1 | 25.0% |
| 9 | 1 | 25.0% |
| 1 | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1633 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 13 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 11 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 11 | 100.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15880 | 90.5% |
| Common | 1674 | 9.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2124 | 13.4% |
| i | 1775 | 11.2% |
| y | 981 | 6.2% |
| M | 939 | 5.9% |
| U | 884 | 5.6% |
| n | 858 | 5.4% |
| e | 814 | 5.1% |
| r | 781 | 4.9% |
| s | 777 | 4.9% |
| t | 744 | 4.7% |
| Other values (38) | 5203 | 32.8% |
Common
| Value | Count | Frequency (%) |
| 1633 | 97.6% |
| , | 13 | 0.8% |
| ( | 11 | 0.7% |
| ) | 11 | 0.7% |
| - | 2 | 0.1% |
| 5 | 1 | 0.1% |
| 0 | 1 | 0.1% |
| 9 | 1 | 0.1% |
| 1 | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17554 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2124 | 12.1% |
| i | 1775 | 10.1% |
| 1633 | 9.3% |
| y | 981 | 5.6% |
| M | 939 | 5.3% |
| U | 884 | 5.0% |
| n | 858 | 4.9% |
| e | 814 | 4.6% |
| r | 781 | 4.4% |
| s | 777 | 4.4% |
| Other values (47) | 5988 | 34.1% |
| Distinct | 77 |
|---|
| Distinct (%) | 7.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 27 |
|---|
| Median length | 8 |
|---|
| Mean length | 8.189163498 |
|---|
| Min length | 2 |
|---|
Characters and Unicode
| Total characters | 8615 |
|---|
| Distinct characters | 45 |
|---|
| Distinct categories | 5 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | test |
|---|
| 2nd row | Malaysia |
|---|
| 3rd row | Malaysia |
|---|
| 4th row | Malaysia |
|---|
| 5th row | Malaysia |
|---|
| Value | Count | Frequency (%) |
| malaysia | 915 | 84.0% |
| kelantan | 30 | 2.8% |
| selangor | 16 | 1.5% |
| kedah | 11 | 1.0% |
| kuala | 11 | 1.0% |
| terengganu | 10 | 0.9% |
| johor | 9 | 0.8% |
| mas | 9 | 0.8% |
| pasir | 9 | 0.8% |
| lumpur | 6 | 0.6% |
| Other values (40) | 63 | 5.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2714 | 31.5% |
| l | 910 | 10.6% |
| M | 901 | 10.5% |
| s | 875 | 10.2% |
| i | 867 | 10.1% |
| y | 851 | 9.9% |
| A | 248 | 2.9% |
| 185 | 2.1% |
| n | 109 | 1.3% |
| S | 95 | 1.1% |
| Other values (35) | 860 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6757 | 78.4% |
| Uppercase Letter | 1664 | 19.3% |
| Space Separator | 185 | 2.1% |
| Other Punctuation | 8 | 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2714 | 40.2% |
| l | 910 | 13.5% |
| s | 875 | 12.9% |
| i | 867 | 12.8% |
| y | 851 | 12.6% |
| n | 109 | 1.6% |
| e | 80 | 1.2% |
| r | 53 | 0.8% |
| g | 47 | 0.7% |
| m | 42 | 0.6% |
| Other values (11) | 209 | 3.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 901 | 54.1% |
| A | 248 | 14.9% |
| S | 95 | 5.7% |
| L | 84 | 5.0% |
| I | 80 | 4.8% |
| Y | 72 | 4.3% |
| K | 50 | 3.0% |
| N | 24 | 1.4% |
| P | 21 | 1.3% |
| E | 20 | 1.2% |
| Other values (10) | 69 | 4.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7 | 87.5% |
| ' | 1 | 12.5% |
Space Separator
| Value | Count | Frequency (%) |
| 185 | 100.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8421 | 97.7% |
| Common | 194 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2714 | 32.2% |
| l | 910 | 10.8% |
| M | 901 | 10.7% |
| s | 875 | 10.4% |
| i | 867 | 10.3% |
| y | 851 | 10.1% |
| A | 248 | 2.9% |
| n | 109 | 1.3% |
| S | 95 | 1.1% |
| L | 84 | 1.0% |
| Other values (31) | 767 | 9.1% |
Common
| Value | Count | Frequency (%) |
| 185 | 95.4% |
| , | 7 | 3.6% |
| ' | 1 | 0.5% |
| - | 1 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8615 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2714 | 31.5% |
| l | 910 | 10.6% |
| M | 901 | 10.5% |
| s | 875 | 10.2% |
| i | 867 | 10.1% |
| y | 851 | 9.9% |
| A | 248 | 2.9% |
| 185 | 2.1% |
| n | 109 | 1.3% |
| S | 95 | 1.1% |
| Other values (35) | 860 | 10.0% |
| Distinct | 6 |
|---|
| Distinct (%) | 0.6% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 21 |
|---|
| Median length | 18 |
|---|
| Mean length | 18.82034221 |
|---|
| Min length | 16 |
|---|
Characters and Unicode
| Total characters | 19799 |
|---|
| Distinct characters | 26 |
|---|
| Distinct categories | 8 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 4 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | RM 3001 - 10 000 |
|---|
| 2nd row | RM 3001 - 10 000 |
|---|
| 3rd row | RM 3001 - 10 000 |
|---|
| 4th row | RM 10 001 - 25 000 |
|---|
| 5th row | RM 10 001 - 25 000 |
|---|
| Value | Count | Frequency (%) |
| rm | 948 | 23.0% |
| than | 737 | 17.9% |
| less | 633 | 15.4% |
| 4,849 | 620 | 15.0% |
| 4,850 | 300 | 7.3% |
| – | 300 | 7.3% |
| rm10,959 | 300 | 7.3% |
| more | 104 | 2.5% |
| rm10,960 | 104 | 2.5% |
| 15 | 0.4% |
| Other values (6) | 61 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3070 | 15.5% |
| 4 | 1540 | 7.8% |
| M | 1456 | 7.4% |
| R | 1352 | 6.8% |
| 9 | 1324 | 6.7% |
| , | 1324 | 6.7% |
| s | 1266 | 6.4% |
| 0 | 937 | 4.7% |
| 8 | 920 | 4.6% |
| t | 737 | 3.7% |
| Other values (16) | 5873 | 29.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5890 | 29.7% |
| Lowercase Letter | 5459 | 27.6% |
| Uppercase Letter | 3441 | 17.4% |
| Space Separator | 3070 | 15.5% |
| Other Punctuation | 1324 | 6.7% |
| Currency Symbol | 300 | 1.5% |
| Initial Punctuation | 300 | 1.5% |
| Dash Punctuation | 15 | 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1540 | 26.1% |
| 9 | 1324 | 22.5% |
| 0 | 937 | 15.9% |
| 8 | 920 | 15.6% |
| 5 | 603 | 10.2% |
| 1 | 434 | 7.4% |
| 6 | 104 | 1.8% |
| 3 | 25 | 0.4% |
| 2 | 3 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1266 | 23.2% |
| t | 737 | 13.5% |
| h | 737 | 13.5% |
| a | 737 | 13.5% |
| n | 737 | 13.5% |
| e | 737 | 13.5% |
| â | 300 | 5.5% |
| o | 104 | 1.9% |
| r | 104 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1456 | 42.3% |
| R | 1352 | 39.3% |
| L | 633 | 18.4% |
Space Separator
| Value | Count | Frequency (%) |
| 3070 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1324 | 100.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 300 | 100.0% |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 300 | 100.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10899 | 55.0% |
| Latin | 8900 | 45.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3070 | 28.2% |
| 4 | 1540 | 14.1% |
| 9 | 1324 | 12.1% |
| , | 1324 | 12.1% |
| 0 | 937 | 8.6% |
| 8 | 920 | 8.4% |
| 5 | 603 | 5.5% |
| 1 | 434 | 4.0% |
| € | 300 | 2.8% |
| “ | 300 | 2.8% |
| Other values (4) | 147 | 1.3% |
Latin
| Value | Count | Frequency (%) |
| M | 1456 | 16.4% |
| R | 1352 | 15.2% |
| s | 1266 | 14.2% |
| t | 737 | 8.3% |
| h | 737 | 8.3% |
| a | 737 | 8.3% |
| n | 737 | 8.3% |
| e | 737 | 8.3% |
| L | 633 | 7.1% |
| â | 300 | 3.4% |
| Other values (2) | 208 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18899 | 95.5% |
| None | 300 | 1.5% |
| Currency Symbols | 300 | 1.5% |
| Punctuation | 300 | 1.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3070 | 16.2% |
| 4 | 1540 | 8.1% |
| M | 1456 | 7.7% |
| R | 1352 | 7.2% |
| 9 | 1324 | 7.0% |
| , | 1324 | 7.0% |
| s | 1266 | 6.7% |
| 0 | 937 | 5.0% |
| 8 | 920 | 4.9% |
| t | 737 | 3.9% |
| Other values (13) | 4973 | 26.3% |
None
| Value | Count | Frequency (%) |
| â | 300 | 100.0% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 300 | 100.0% |
Punctuation
| Value | Count | Frequency (%) |
| “ | 300 | 100.0% |
| Distinct | 7 |
|---|
| Distinct (%) | 0.7% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 102 |
|---|
| Median length | 88 |
|---|
| Mean length | 40.35931559 |
|---|
| Min length | 12 |
|---|
Characters and Unicode
| Total characters | 42458 |
|---|
| Distinct characters | 27 |
|---|
| Distinct categories | 6 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Face to Face |
|---|
| 2nd row | Face to Face, Synchronous Online Learning (Real Time) |
|---|
| 3rd row | Face to Face |
|---|
| 4th row | Face to Face, Asynchronous Online Learning (On your own time) |
|---|
| 5th row | Face to Face |
|---|
| Value | Count | Frequency (%) |
| face | 1620 | 23.7% |
| to | 810 | 11.8% |
| online | 742 | 10.8% |
| learning | 742 | 10.8% |
| time | 742 | 10.8% |
| synchronous | 392 | 5.7% |
| real | 392 | 5.7% |
| asynchronous | 350 | 5.1% |
| on | 350 | 5.1% |
| your | 350 | 5.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5788 | 13.6% |
| n | 5152 | 12.1% |
| e | 4238 | 10.0% |
| o | 2994 | 7.1% |
| a | 2754 | 6.5% |
| c | 2362 | 5.6% |
| i | 2226 | 5.2% |
| r | 1834 | 4.3% |
| F | 1620 | 3.8% |
| t | 1160 | 2.7% |
| Other values (17) | 12330 | 29.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29706 | 70.0% |
| Space Separator | 5788 | 13.6% |
| Uppercase Letter | 4980 | 11.7% |
| Open Punctuation | 742 | 1.7% |
| Close Punctuation | 742 | 1.7% |
| Other Punctuation | 500 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 5152 | 17.3% |
| e | 4238 | 14.3% |
| o | 2994 | 10.1% |
| a | 2754 | 9.3% |
| c | 2362 | 8.0% |
| i | 2226 | 7.5% |
| r | 1834 | 6.2% |
| t | 1160 | 3.9% |
| l | 1134 | 3.8% |
| s | 1092 | 3.7% |
| Other values (6) | 4760 | 16.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 1620 | 32.5% |
| O | 1092 | 21.9% |
| L | 742 | 14.9% |
| S | 392 | 7.9% |
| R | 392 | 7.9% |
| T | 392 | 7.9% |
| A | 350 | 7.0% |
Space Separator
| Value | Count | Frequency (%) |
| 5788 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 742 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 742 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 500 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34686 | 81.7% |
| Common | 7772 | 18.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 5152 | 14.9% |
| e | 4238 | 12.2% |
| o | 2994 | 8.6% |
| a | 2754 | 7.9% |
| c | 2362 | 6.8% |
| i | 2226 | 6.4% |
| r | 1834 | 5.3% |
| F | 1620 | 4.7% |
| t | 1160 | 3.3% |
| l | 1134 | 3.3% |
| Other values (13) | 9212 | 26.6% |
Common
| Value | Count | Frequency (%) |
| 5788 | 74.5% |
| ( | 742 | 9.5% |
| ) | 742 | 9.5% |
| , | 500 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42458 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5788 | 13.6% |
| n | 5152 | 12.1% |
| e | 4238 | 10.0% |
| o | 2994 | 7.1% |
| a | 2754 | 6.5% |
| c | 2362 | 5.6% |
| i | 2226 | 5.2% |
| r | 1834 | 4.3% |
| F | 1620 | 3.8% |
| t | 1160 | 2.7% |
| Other values (17) | 12330 | 29.0% |
| Distinct | 91 |
|---|
| Distinct (%) | 8.7% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 62 |
|---|
| Median length | 47 |
|---|
| Mean length | 19.21768061 |
|---|
| Min length | 4 |
|---|
Characters and Unicode
| Total characters | 20217 |
|---|
| Distinct characters | 45 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Twitter |
|---|
| 2nd row | Instagram |
|---|
| 3rd row | Facebook, Twitter, Instagram |
|---|
| 4th row | Facebook, Youtube |
|---|
| 5th row | Facebook |
|---|
| Value | Count | Frequency (%) |
| youtube | 753 | 33.7% |
| instagram | 591 | 26.4% |
| facebook | 421 | 18.8% |
| twitter | 270 | 12.1% |
| blogger/wordpress | 72 | 3.2% |
| google | 22 | 1.0% |
| meet | 17 | 0.8% |
| tiktok | 13 | 0.6% |
| teams | 13 | 0.6% |
| microsoft | 10 | 0.4% |
| Other values (25) | 55 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1950 | 9.6% |
| o | 1840 | 9.1% |
| e | 1686 | 8.3% |
| a | 1663 | 8.2% |
| u | 1509 | 7.5% |
| 1197 | 5.9% |
| b | 1175 | 5.8% |
| , | 1145 | 5.7% |
| r | 1105 | 5.5% |
| s | 784 | 3.9% |
| Other values (35) | 6163 | 30.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15494 | 76.6% |
| Uppercase Letter | 2307 | 11.4% |
| Other Punctuation | 1219 | 6.0% |
| Space Separator | 1197 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1950 | 12.6% |
| o | 1840 | 11.9% |
| e | 1686 | 10.9% |
| a | 1663 | 10.7% |
| u | 1509 | 9.7% |
| b | 1175 | 7.6% |
| r | 1105 | 7.1% |
| s | 784 | 5.1% |
| g | 768 | 5.0% |
| m | 638 | 4.1% |
| Other values (12) | 2376 | 15.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 753 | 32.6% |
| I | 594 | 25.7% |
| F | 423 | 18.3% |
| T | 304 | 13.2% |
| W | 85 | 3.7% |
| B | 72 | 3.1% |
| M | 23 | 1.0% |
| G | 21 | 0.9% |
| C | 6 | 0.3% |
| O | 6 | 0.3% |
| Other values (9) | 20 | 0.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1145 | 93.9% |
| / | 72 | 5.9% |
| . | 2 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1197 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17801 | 88.0% |
| Common | 2416 | 12.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1950 | 11.0% |
| o | 1840 | 10.3% |
| e | 1686 | 9.5% |
| a | 1663 | 9.3% |
| u | 1509 | 8.5% |
| b | 1175 | 6.6% |
| r | 1105 | 6.2% |
| s | 784 | 4.4% |
| g | 768 | 4.3% |
| Y | 753 | 4.2% |
| Other values (31) | 4568 | 25.7% |
Common
| Value | Count | Frequency (%) |
| 1197 | 49.5% |
| , | 1145 | 47.4% |
| / | 72 | 3.0% |
| . | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20217 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1950 | 9.6% |
| o | 1840 | 9.1% |
| e | 1686 | 8.3% |
| a | 1663 | 8.2% |
| u | 1509 | 7.5% |
| 1197 | 5.9% |
| b | 1175 | 5.8% |
| , | 1145 | 5.7% |
| r | 1105 | 5.5% |
| s | 784 | 3.9% |
| Other values (35) | 6163 | 30.5% |
| Distinct | 52 |
|---|
| Distinct (%) | 4.9% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 80 |
|---|
| Median length | 63 |
|---|
| Mean length | 25.10646388 |
|---|
| Min length | 4 |
|---|
Characters and Unicode
| Total characters | 26412 |
|---|
| Distinct characters | 42 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Whatsapp |
|---|
| 2nd row | Whatsapp |
|---|
| 3rd row | Email, University eLearning Chat Room, Whatsapp, Call, Telegram |
|---|
| 4th row | Email, Whatsapp |
|---|
| 5th row | Whatsapp |
|---|
| Value | Count | Frequency (%) |
| whatsapp | 940 | 28.5% |
| telegram | 554 | 16.8% |
| email | 499 | 15.1% |
| university | 272 | 8.2% |
| elearning | 272 | 8.2% |
| chat | 272 | 8.2% |
| room | 272 | 8.2% |
| call | 155 | 4.7% |
| google | 15 | 0.5% |
| meet | 13 | 0.4% |
| Other values (14) | 36 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3646 | 13.8% |
| 2251 | 8.5% |
| e | 1979 | 7.5% |
| p | 1880 | 7.1% |
| t | 1506 | 5.7% |
| , | 1404 | 5.3% |
| l | 1383 | 5.2% |
| m | 1346 | 5.1% |
| i | 1327 | 5.0% |
| s | 1232 | 4.7% |
| Other values (32) | 8458 | 32.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19454 | 73.7% |
| Uppercase Letter | 3303 | 12.5% |
| Space Separator | 2251 | 8.5% |
| Other Punctuation | 1404 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3646 | 18.7% |
| e | 1979 | 10.2% |
| p | 1880 | 9.7% |
| t | 1506 | 7.7% |
| l | 1383 | 7.1% |
| m | 1346 | 6.9% |
| i | 1327 | 6.8% |
| s | 1232 | 6.3% |
| h | 1212 | 6.2% |
| r | 1111 | 5.7% |
| Other values (12) | 2832 | 14.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 942 | 28.5% |
| T | 561 | 17.0% |
| E | 500 | 15.1% |
| C | 432 | 13.1% |
| R | 273 | 8.3% |
| L | 273 | 8.3% |
| U | 272 | 8.2% |
| M | 15 | 0.5% |
| G | 15 | 0.5% |
| Z | 5 | 0.2% |
| Other values (8) | 15 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 2251 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1404 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22757 | 86.2% |
| Common | 3655 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3646 | 16.0% |
| e | 1979 | 8.7% |
| p | 1880 | 8.3% |
| t | 1506 | 6.6% |
| l | 1383 | 6.1% |
| m | 1346 | 5.9% |
| i | 1327 | 5.8% |
| s | 1232 | 5.4% |
| h | 1212 | 5.3% |
| r | 1111 | 4.9% |
| Other values (30) | 6135 | 27.0% |
Common
| Value | Count | Frequency (%) |
| 2251 | 61.6% |
| , | 1404 | 38.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26412 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3646 | 13.8% |
| 2251 | 8.5% |
| e | 1979 | 7.5% |
| p | 1880 | 7.1% |
| t | 1506 | 5.7% |
| , | 1404 | 5.3% |
| l | 1383 | 5.2% |
| m | 1346 | 5.1% |
| i | 1327 | 5.0% |
| s | 1232 | 4.7% |
| Other values (32) | 8458 | 32.0% |
| Distinct | 411 |
|---|
| Distinct (%) | 39.1% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 243 |
|---|
| Median length | 121 |
|---|
| Mean length | 65.56463878 |
|---|
| Min length | 2 |
|---|
Characters and Unicode
| Total characters | 68974 |
|---|
| Distinct characters | 51 |
|---|
| Distinct categories | 11 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 4 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 220 ? |
|---|
| Unique (%) | 20.9% |
|---|
Sample
| 1st row | Adaptability |
|---|
| 2nd row | Technical Issues, Quality of Material |
|---|
| 3rd row | Technical Issues, Engagement |
|---|
| 4th row | Adaptability, Technical Issues, Time Management, Self-Motivation, Quality of Material, Engagement |
|---|
| 5th row | Technical Issues |
|---|
| Value | Count | Frequency (%) |
| issues | 720 | 11.5% |
| technical | 680 | 10.9% |
| self-motivation | 629 | 10.1% |
| management | 464 | 7.4% |
| time | 463 | 7.4% |
| cost/focus/commitment | 407 | 6.5% |
| adaptability | 352 | 5.6% |
| accessibility | 348 | 5.6% |
| engagement | 345 | 5.5% |
| of | 308 | 4.9% |
| Other values (97) | 1525 | 24.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 6029 | 8.7% |
| e | 5975 | 8.7% |
| i | 5664 | 8.2% |
| 5191 | 7.5% |
| a | 4530 | 6.6% |
| s | 3986 | 5.8% |
| n | 3621 | 5.2% |
| o | 3566 | 5.2% |
| m | 3404 | 4.9% |
| , | 3060 | 4.4% |
| Other values (41) | 23948 | 34.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 51551 | 74.7% |
| Uppercase Letter | 7496 | 10.9% |
| Space Separator | 5191 | 7.5% |
| Other Punctuation | 4096 | 5.9% |
| Dash Punctuation | 630 | 0.9% |
| Currency Symbol | 2 | < 0.1% |
| Other Symbol | 2 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Decimal Number | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 6029 | 11.7% |
| e | 5975 | 11.6% |
| i | 5664 | 11.0% |
| a | 4530 | 8.8% |
| s | 3986 | 7.7% |
| n | 3621 | 7.0% |
| o | 3566 | 6.9% |
| m | 3404 | 6.6% |
| c | 2938 | 5.7% |
| l | 2688 | 5.2% |
| Other values (15) | 9150 | 17.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1398 | 18.6% |
| C | 1328 | 17.7% |
| T | 1146 | 15.3% |
| I | 724 | 9.7% |
| A | 700 | 9.3% |
| S | 631 | 8.4% |
| F | 628 | 8.4% |
| E | 347 | 4.6% |
| Q | 306 | 4.1% |
| L | 237 | 3.2% |
| Other values (4) | 51 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3060 | 74.7% |
| / | 1034 | 25.2% |
| ' | 1 | < 0.1% |
| : | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5191 | 100.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 630 | 100.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 2 | 100.0% |
Other Symbol
| Value | Count | Frequency (%) |
| â„¢ | 2 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 | 100.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1 | 100.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59047 | 85.6% |
| Common | 9927 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 6029 | 10.2% |
| e | 5975 | 10.1% |
| i | 5664 | 9.6% |
| a | 4530 | 7.7% |
| s | 3986 | 6.8% |
| n | 3621 | 6.1% |
| o | 3566 | 6.0% |
| m | 3404 | 5.8% |
| c | 2938 | 5.0% |
| l | 2688 | 4.6% |
| Other values (29) | 16646 | 28.2% |
Common
| Value | Count | Frequency (%) |
| 5191 | 52.3% |
| , | 3060 | 30.8% |
| / | 1034 | 10.4% |
| - | 630 | 6.3% |
| € | 2 | < 0.1% |
| â„¢ | 2 | < 0.1% |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
| ' | 1 | < 0.1% |
| 2 | 1 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 68968 | > 99.9% |
| None | 2 | < 0.1% |
| Currency Symbols | 2 | < 0.1% |
| Letterlike Symbols | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 6029 | 8.7% |
| e | 5975 | 8.7% |
| i | 5664 | 8.2% |
| 5191 | 7.5% |
| a | 4530 | 6.6% |
| s | 3986 | 5.8% |
| n | 3621 | 5.3% |
| o | 3566 | 5.2% |
| m | 3404 | 4.9% |
| , | 3060 | 4.4% |
| Other values (38) | 23942 | 34.7% |
None
| Value | Count | Frequency (%) |
| â | 2 | 100.0% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 2 | 100.0% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| â„¢ | 2 | 100.0% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.05608365 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9527 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Very Much |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| very | 534 | 32.3% |
| much | 534 | 32.3% |
| somewhat | 400 | 24.2% |
| undecided | 59 | 3.6% |
| not | 59 | 3.6% |
| really | 52 | 3.1% |
| at | 7 | 0.4% |
| all | 7 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1104 | 11.6% |
| h | 934 | 9.8% |
| 600 | 6.3% |
| c | 593 | 6.2% |
| y | 586 | 6.2% |
| V | 534 | 5.6% |
| r | 534 | 5.6% |
| M | 534 | 5.6% |
| u | 534 | 5.6% |
| t | 466 | 4.9% |
| Other values (14) | 3108 | 32.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6882 | 72.2% |
| Uppercase Letter | 1645 | 17.3% |
| Space Separator | 600 | 6.3% |
| Control | 400 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1104 | 16.0% |
| h | 934 | 13.6% |
| c | 593 | 8.6% |
| y | 586 | 8.5% |
| r | 534 | 7.8% |
| u | 534 | 7.8% |
| t | 466 | 6.8% |
| a | 459 | 6.7% |
| o | 459 | 6.7% |
| w | 400 | 5.8% |
| Other values (5) | 813 | 11.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 534 | 32.5% |
| M | 534 | 32.5% |
| S | 400 | 24.3% |
| U | 59 | 3.6% |
| N | 59 | 3.6% |
| R | 52 | 3.2% |
| A | 7 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 600 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 400 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8527 | 89.5% |
| Common | 1000 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1104 | 12.9% |
| h | 934 | 11.0% |
| c | 593 | 7.0% |
| y | 586 | 6.9% |
| V | 534 | 6.3% |
| r | 534 | 6.3% |
| M | 534 | 6.3% |
| u | 534 | 6.3% |
| t | 466 | 5.5% |
| a | 459 | 5.4% |
| Other values (12) | 2249 | 26.4% |
Common
| Value | Count | Frequency (%) |
| 600 | 60.0% |
| 400 | 40.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9527 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1104 | 11.6% |
| h | 934 | 9.8% |
| 600 | 6.3% |
| c | 593 | 6.2% |
| y | 586 | 6.2% |
| V | 534 | 5.6% |
| r | 534 | 5.6% |
| M | 534 | 5.6% |
| u | 534 | 5.6% |
| t | 466 | 4.9% |
| Other values (14) | 3108 | 32.6% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.171102662 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9648 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 407 | 25.7% |
| very | 312 | 19.7% |
| much | 312 | 19.7% |
| not | 180 | 11.4% |
| undecided | 153 | 9.7% |
| really | 142 | 9.0% |
| at | 38 | 2.4% |
| all | 38 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1167 | 12.1% |
| h | 719 | 7.5% |
| t | 625 | 6.5% |
| a | 587 | 6.1% |
| o | 587 | 6.1% |
| 530 | 5.5% |
| c | 465 | 4.8% |
| d | 459 | 4.8% |
| y | 454 | 4.7% |
| S | 407 | 4.2% |
| Other values (14) | 3648 | 37.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7167 | 74.3% |
| Uppercase Letter | 1544 | 16.0% |
| Space Separator | 530 | 5.5% |
| Control | 407 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1167 | 16.3% |
| h | 719 | 10.0% |
| t | 625 | 8.7% |
| a | 587 | 8.2% |
| o | 587 | 8.2% |
| c | 465 | 6.5% |
| d | 459 | 6.4% |
| y | 454 | 6.3% |
| w | 407 | 5.7% |
| m | 407 | 5.7% |
| Other values (5) | 1290 | 18.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 407 | 26.4% |
| V | 312 | 20.2% |
| M | 312 | 20.2% |
| N | 180 | 11.7% |
| U | 153 | 9.9% |
| R | 142 | 9.2% |
| A | 38 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 530 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 407 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8711 | 90.3% |
| Common | 937 | 9.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1167 | 13.4% |
| h | 719 | 8.3% |
| t | 625 | 7.2% |
| a | 587 | 6.7% |
| o | 587 | 6.7% |
| c | 465 | 5.3% |
| d | 459 | 5.3% |
| y | 454 | 5.2% |
| S | 407 | 4.7% |
| w | 407 | 4.7% |
| Other values (12) | 2834 | 32.5% |
Common
| Value | Count | Frequency (%) |
| 530 | 56.6% |
| 407 | 43.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9648 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1167 | 12.1% |
| h | 719 | 7.5% |
| t | 625 | 6.5% |
| a | 587 | 6.1% |
| o | 587 | 6.1% |
| 530 | 5.5% |
| c | 465 | 4.8% |
| d | 459 | 4.8% |
| y | 454 | 4.7% |
| S | 407 | 4.2% |
| Other values (14) | 3648 | 37.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.021863118 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9491 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Very Much |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| very | 703 | 39.5% |
| much | 703 | 39.5% |
| somewhat | 271 | 15.2% |
| undecided | 55 | 3.1% |
| not | 23 | 1.3% |
| really | 22 | 1.2% |
| at | 1 | 0.1% |
| all | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1106 | 11.7% |
| h | 974 | 10.3% |
| c | 758 | 8.0% |
| 727 | 7.7% |
| y | 725 | 7.6% |
| V | 703 | 7.4% |
| r | 703 | 7.4% |
| M | 703 | 7.4% |
| u | 703 | 7.4% |
| t | 295 | 3.1% |
| Other values (14) | 2094 | 22.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6715 | 70.8% |
| Uppercase Letter | 1778 | 18.7% |
| Space Separator | 727 | 7.7% |
| Control | 271 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1106 | 16.5% |
| h | 974 | 14.5% |
| c | 758 | 11.3% |
| y | 725 | 10.8% |
| r | 703 | 10.5% |
| u | 703 | 10.5% |
| t | 295 | 4.4% |
| a | 294 | 4.4% |
| o | 294 | 4.4% |
| w | 271 | 4.0% |
| Other values (5) | 592 | 8.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 703 | 39.5% |
| M | 703 | 39.5% |
| S | 271 | 15.2% |
| U | 55 | 3.1% |
| N | 23 | 1.3% |
| R | 22 | 1.2% |
| A | 1 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 727 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 271 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8493 | 89.5% |
| Common | 998 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1106 | 13.0% |
| h | 974 | 11.5% |
| c | 758 | 8.9% |
| y | 725 | 8.5% |
| V | 703 | 8.3% |
| r | 703 | 8.3% |
| M | 703 | 8.3% |
| u | 703 | 8.3% |
| t | 295 | 3.5% |
| a | 294 | 3.5% |
| Other values (12) | 1529 | 18.0% |
Common
| Value | Count | Frequency (%) |
| 727 | 72.8% |
| 271 | 27.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9491 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1106 | 11.7% |
| h | 974 | 10.3% |
| c | 758 | 8.0% |
| 727 | 7.7% |
| y | 725 | 7.6% |
| V | 703 | 7.4% |
| r | 703 | 7.4% |
| M | 703 | 7.4% |
| u | 703 | 7.4% |
| t | 295 | 3.1% |
| Other values (14) | 2094 | 22.1% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.136882129 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9612 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 371 | 23.1% |
| much | 371 | 23.1% |
| somewhat | 359 | 22.4% |
| undecided | 178 | 11.1% |
| not | 144 | 9.0% |
| really | 105 | 6.5% |
| at | 39 | 2.4% |
| all | 39 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1191 | 12.4% |
| h | 730 | 7.6% |
| 554 | 5.8% |
| c | 549 | 5.7% |
| t | 542 | 5.6% |
| d | 534 | 5.6% |
| o | 503 | 5.2% |
| a | 503 | 5.2% |
| y | 476 | 5.0% |
| V | 371 | 3.9% |
| Other values (14) | 3659 | 38.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7132 | 74.2% |
| Uppercase Letter | 1567 | 16.3% |
| Space Separator | 554 | 5.8% |
| Control | 359 | 3.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1191 | 16.7% |
| h | 730 | 10.2% |
| c | 549 | 7.7% |
| t | 542 | 7.6% |
| d | 534 | 7.5% |
| o | 503 | 7.1% |
| a | 503 | 7.1% |
| y | 476 | 6.7% |
| r | 371 | 5.2% |
| u | 371 | 5.2% |
| Other values (5) | 1362 | 19.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 371 | 23.7% |
| M | 371 | 23.7% |
| S | 359 | 22.9% |
| U | 178 | 11.4% |
| N | 144 | 9.2% |
| R | 105 | 6.7% |
| A | 39 | 2.5% |
Space Separator
| Value | Count | Frequency (%) |
| 554 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 359 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8699 | 90.5% |
| Common | 913 | 9.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1191 | 13.7% |
| h | 730 | 8.4% |
| c | 549 | 6.3% |
| t | 542 | 6.2% |
| d | 534 | 6.1% |
| o | 503 | 5.8% |
| a | 503 | 5.8% |
| y | 476 | 5.5% |
| V | 371 | 4.3% |
| r | 371 | 4.3% |
| Other values (12) | 2929 | 33.7% |
Common
| Value | Count | Frequency (%) |
| 554 | 60.7% |
| 359 | 39.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9612 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1191 | 12.4% |
| h | 730 | 7.6% |
| 554 | 5.8% |
| c | 549 | 5.7% |
| t | 542 | 5.6% |
| d | 534 | 5.6% |
| o | 503 | 5.2% |
| a | 503 | 5.2% |
| y | 476 | 5.0% |
| V | 371 | 3.9% |
| Other values (14) | 3659 | 38.1% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.036121673 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9506 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Undecided |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 504 | 31.5% |
| much | 504 | 31.5% |
| somewhat | 418 | 26.1% |
| undecided | 92 | 5.7% |
| not | 38 | 2.4% |
| really | 30 | 1.9% |
| at | 8 | 0.5% |
| all | 8 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1136 | 12.0% |
| h | 922 | 9.7% |
| c | 596 | 6.3% |
| 550 | 5.8% |
| y | 534 | 5.6% |
| V | 504 | 5.3% |
| r | 504 | 5.3% |
| M | 504 | 5.3% |
| u | 504 | 5.3% |
| t | 464 | 4.9% |
| Other values (14) | 3288 | 34.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6944 | 73.0% |
| Uppercase Letter | 1594 | 16.8% |
| Space Separator | 550 | 5.8% |
| Control | 418 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1136 | 16.4% |
| h | 922 | 13.3% |
| c | 596 | 8.6% |
| y | 534 | 7.7% |
| r | 504 | 7.3% |
| u | 504 | 7.3% |
| t | 464 | 6.7% |
| a | 456 | 6.6% |
| o | 456 | 6.6% |
| w | 418 | 6.0% |
| Other values (5) | 954 | 13.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 504 | 31.6% |
| M | 504 | 31.6% |
| S | 418 | 26.2% |
| U | 92 | 5.8% |
| N | 38 | 2.4% |
| R | 30 | 1.9% |
| A | 8 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 550 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 418 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8538 | 89.8% |
| Common | 968 | 10.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1136 | 13.3% |
| h | 922 | 10.8% |
| c | 596 | 7.0% |
| y | 534 | 6.3% |
| V | 504 | 5.9% |
| r | 504 | 5.9% |
| M | 504 | 5.9% |
| u | 504 | 5.9% |
| t | 464 | 5.4% |
| a | 456 | 5.3% |
| Other values (12) | 2414 | 28.3% |
Common
| Value | Count | Frequency (%) |
| 550 | 56.8% |
| 418 | 43.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9506 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1136 | 12.0% |
| h | 922 | 9.7% |
| c | 596 | 6.3% |
| 550 | 5.8% |
| y | 534 | 5.6% |
| V | 504 | 5.3% |
| r | 504 | 5.3% |
| M | 504 | 5.3% |
| u | 504 | 5.3% |
| t | 464 | 4.9% |
| Other values (14) | 3288 | 34.6% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.148288973 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9624 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Undecided |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 371 | 23.0% |
| very | 363 | 22.5% |
| much | 363 | 22.5% |
| undecided | 162 | 10.0% |
| not | 156 | 9.7% |
| really | 111 | 6.9% |
| at | 45 | 2.8% |
| all | 45 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1169 | 12.1% |
| h | 734 | 7.6% |
| t | 572 | 5.9% |
| 564 | 5.9% |
| a | 527 | 5.5% |
| o | 527 | 5.5% |
| c | 525 | 5.5% |
| d | 486 | 5.0% |
| y | 474 | 4.9% |
| S | 371 | 3.9% |
| Other values (14) | 3675 | 38.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7118 | 74.0% |
| Uppercase Letter | 1571 | 16.3% |
| Space Separator | 564 | 5.9% |
| Control | 371 | 3.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1169 | 16.4% |
| h | 734 | 10.3% |
| t | 572 | 8.0% |
| a | 527 | 7.4% |
| o | 527 | 7.4% |
| c | 525 | 7.4% |
| d | 486 | 6.8% |
| y | 474 | 6.7% |
| w | 371 | 5.2% |
| m | 371 | 5.2% |
| Other values (5) | 1362 | 19.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 371 | 23.6% |
| V | 363 | 23.1% |
| M | 363 | 23.1% |
| U | 162 | 10.3% |
| N | 156 | 9.9% |
| R | 111 | 7.1% |
| A | 45 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 564 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 371 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8689 | 90.3% |
| Common | 935 | 9.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1169 | 13.5% |
| h | 734 | 8.4% |
| t | 572 | 6.6% |
| a | 527 | 6.1% |
| o | 527 | 6.1% |
| c | 525 | 6.0% |
| d | 486 | 5.6% |
| y | 474 | 5.5% |
| S | 371 | 4.3% |
| w | 371 | 4.3% |
| Other values (12) | 2933 | 33.8% |
Common
| Value | Count | Frequency (%) |
| 564 | 60.3% |
| 371 | 39.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9624 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1169 | 12.1% |
| h | 734 | 7.6% |
| t | 572 | 5.9% |
| 564 | 5.9% |
| a | 527 | 5.5% |
| o | 527 | 5.5% |
| c | 525 | 5.5% |
| d | 486 | 5.0% |
| y | 474 | 4.9% |
| S | 371 | 3.9% |
| Other values (14) | 3675 | 38.2% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.094106464 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9567 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| very | 408 | 25.8% |
| much | 408 | 25.8% |
| somewhat | 377 | 23.8% |
| undecided | 168 | 10.6% |
| not | 99 | 6.2% |
| really | 74 | 4.7% |
| at | 25 | 1.6% |
| all | 25 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1195 | 12.5% |
| h | 785 | 8.2% |
| c | 576 | 6.0% |
| 532 | 5.6% |
| d | 504 | 5.3% |
| t | 501 | 5.2% |
| y | 482 | 5.0% |
| o | 476 | 5.0% |
| a | 476 | 5.0% |
| V | 408 | 4.3% |
| Other values (14) | 3632 | 38.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7099 | 74.2% |
| Uppercase Letter | 1559 | 16.3% |
| Space Separator | 532 | 5.6% |
| Control | 377 | 3.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1195 | 16.8% |
| h | 785 | 11.1% |
| c | 576 | 8.1% |
| d | 504 | 7.1% |
| t | 501 | 7.1% |
| y | 482 | 6.8% |
| o | 476 | 6.7% |
| a | 476 | 6.7% |
| r | 408 | 5.7% |
| u | 408 | 5.7% |
| Other values (5) | 1288 | 18.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 408 | 26.2% |
| M | 408 | 26.2% |
| S | 377 | 24.2% |
| U | 168 | 10.8% |
| N | 99 | 6.4% |
| R | 74 | 4.7% |
| A | 25 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 532 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 377 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8658 | 90.5% |
| Common | 909 | 9.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1195 | 13.8% |
| h | 785 | 9.1% |
| c | 576 | 6.7% |
| d | 504 | 5.8% |
| t | 501 | 5.8% |
| y | 482 | 5.6% |
| o | 476 | 5.5% |
| a | 476 | 5.5% |
| V | 408 | 4.7% |
| r | 408 | 4.7% |
| Other values (12) | 2847 | 32.9% |
Common
| Value | Count | Frequency (%) |
| 532 | 58.5% |
| 377 | 41.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9567 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1195 | 12.5% |
| h | 785 | 8.2% |
| c | 576 | 6.0% |
| 532 | 5.6% |
| d | 504 | 5.3% |
| t | 501 | 5.2% |
| y | 482 | 5.0% |
| o | 476 | 5.0% |
| a | 476 | 5.0% |
| V | 408 | 4.3% |
| Other values (14) | 3632 | 38.0% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.085551331 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9558 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Undecided |
|---|
| 4th row | Undecided |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 413 | 26.1% |
| much | 413 | 26.1% |
| somewhat | 362 | 22.9% |
| undecided | 187 | 11.8% |
| not | 90 | 5.7% |
| really | 61 | 3.9% |
| at | 29 | 1.8% |
| all | 29 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1210 | 12.7% |
| h | 775 | 8.1% |
| c | 600 | 6.3% |
| d | 561 | 5.9% |
| 532 | 5.6% |
| t | 481 | 5.0% |
| y | 474 | 5.0% |
| o | 452 | 4.7% |
| a | 452 | 4.7% |
| V | 413 | 4.3% |
| Other values (14) | 3608 | 37.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7109 | 74.4% |
| Uppercase Letter | 1555 | 16.3% |
| Space Separator | 532 | 5.6% |
| Control | 362 | 3.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1210 | 17.0% |
| h | 775 | 10.9% |
| c | 600 | 8.4% |
| d | 561 | 7.9% |
| t | 481 | 6.8% |
| y | 474 | 6.7% |
| o | 452 | 6.4% |
| a | 452 | 6.4% |
| r | 413 | 5.8% |
| u | 413 | 5.8% |
| Other values (5) | 1278 | 18.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 413 | 26.6% |
| M | 413 | 26.6% |
| S | 362 | 23.3% |
| U | 187 | 12.0% |
| N | 90 | 5.8% |
| R | 61 | 3.9% |
| A | 29 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 532 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 362 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8664 | 90.6% |
| Common | 894 | 9.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1210 | 14.0% |
| h | 775 | 8.9% |
| c | 600 | 6.9% |
| d | 561 | 6.5% |
| t | 481 | 5.6% |
| y | 474 | 5.5% |
| o | 452 | 5.2% |
| a | 452 | 5.2% |
| V | 413 | 4.8% |
| r | 413 | 4.8% |
| Other values (12) | 2833 | 32.7% |
Common
| Value | Count | Frequency (%) |
| 532 | 59.5% |
| 362 | 40.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9558 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1210 | 12.7% |
| h | 775 | 8.1% |
| c | 600 | 6.3% |
| d | 561 | 5.9% |
| 532 | 5.6% |
| t | 481 | 5.0% |
| y | 474 | 5.0% |
| o | 452 | 4.7% |
| a | 452 | 4.7% |
| V | 413 | 4.3% |
| Other values (14) | 3608 | 37.7% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.095057034 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9568 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 388 | 25.0% |
| very | 382 | 24.6% |
| much | 382 | 24.6% |
| undecided | 182 | 11.7% |
| not | 100 | 6.4% |
| really | 79 | 5.1% |
| at | 21 | 1.4% |
| all | 21 | 1.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1213 | 12.7% |
| h | 770 | 8.0% |
| c | 564 | 5.9% |
| d | 546 | 5.7% |
| t | 509 | 5.3% |
| 503 | 5.3% |
| a | 488 | 5.1% |
| o | 488 | 5.1% |
| y | 461 | 4.8% |
| S | 388 | 4.1% |
| Other values (14) | 3638 | 38.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7143 | 74.7% |
| Uppercase Letter | 1534 | 16.0% |
| Space Separator | 503 | 5.3% |
| Control | 388 | 4.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1213 | 17.0% |
| h | 770 | 10.8% |
| c | 564 | 7.9% |
| d | 546 | 7.6% |
| t | 509 | 7.1% |
| a | 488 | 6.8% |
| o | 488 | 6.8% |
| y | 461 | 6.5% |
| w | 388 | 5.4% |
| m | 388 | 5.4% |
| Other values (5) | 1328 | 18.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 388 | 25.3% |
| V | 382 | 24.9% |
| M | 382 | 24.9% |
| U | 182 | 11.9% |
| N | 100 | 6.5% |
| R | 79 | 5.1% |
| A | 21 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 503 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 388 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8677 | 90.7% |
| Common | 891 | 9.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1213 | 14.0% |
| h | 770 | 8.9% |
| c | 564 | 6.5% |
| d | 546 | 6.3% |
| t | 509 | 5.9% |
| a | 488 | 5.6% |
| o | 488 | 5.6% |
| y | 461 | 5.3% |
| S | 388 | 4.5% |
| w | 388 | 4.5% |
| Other values (12) | 2862 | 33.0% |
Common
| Value | Count | Frequency (%) |
| 503 | 56.5% |
| 388 | 43.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9568 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1213 | 12.7% |
| h | 770 | 8.0% |
| c | 564 | 5.9% |
| d | 546 | 5.7% |
| t | 509 | 5.3% |
| 503 | 5.3% |
| a | 488 | 5.1% |
| o | 488 | 5.1% |
| y | 461 | 4.8% |
| S | 388 | 4.1% |
| Other values (14) | 3638 | 38.0% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.049429658 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9520 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 449 | 28.7% |
| much | 449 | 28.7% |
| somewhat | 425 | 27.1% |
| undecided | 126 | 8.0% |
| not | 52 | 3.3% |
| really | 38 | 2.4% |
| at | 14 | 0.9% |
| all | 14 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1164 | 12.2% |
| h | 874 | 9.2% |
| c | 575 | 6.0% |
| 515 | 5.4% |
| t | 491 | 5.2% |
| y | 487 | 5.1% |
| a | 477 | 5.0% |
| o | 477 | 5.0% |
| V | 449 | 4.7% |
| r | 449 | 4.7% |
| Other values (14) | 3562 | 37.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7027 | 73.8% |
| Uppercase Letter | 1553 | 16.3% |
| Space Separator | 515 | 5.4% |
| Control | 425 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1164 | 16.6% |
| h | 874 | 12.4% |
| c | 575 | 8.2% |
| t | 491 | 7.0% |
| y | 487 | 6.9% |
| a | 477 | 6.8% |
| o | 477 | 6.8% |
| r | 449 | 6.4% |
| u | 449 | 6.4% |
| w | 425 | 6.0% |
| Other values (5) | 1159 | 16.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 449 | 28.9% |
| M | 449 | 28.9% |
| S | 425 | 27.4% |
| U | 126 | 8.1% |
| N | 52 | 3.3% |
| R | 38 | 2.4% |
| A | 14 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 515 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 425 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8580 | 90.1% |
| Common | 940 | 9.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1164 | 13.6% |
| h | 874 | 10.2% |
| c | 575 | 6.7% |
| t | 491 | 5.7% |
| y | 487 | 5.7% |
| a | 477 | 5.6% |
| o | 477 | 5.6% |
| V | 449 | 5.2% |
| r | 449 | 5.2% |
| M | 449 | 5.2% |
| Other values (12) | 2688 | 31.3% |
Common
| Value | Count | Frequency (%) |
| 515 | 54.8% |
| 425 | 45.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9520 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1164 | 12.2% |
| h | 874 | 9.2% |
| c | 575 | 6.0% |
| 515 | 5.4% |
| t | 491 | 5.2% |
| y | 487 | 5.1% |
| a | 477 | 5.0% |
| o | 477 | 5.0% |
| V | 449 | 4.7% |
| r | 449 | 4.7% |
| Other values (14) | 3562 | 37.4% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.057034221 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9528 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 455 | 28.8% |
| much | 455 | 28.8% |
| somewhat | 396 | 25.1% |
| undecided | 141 | 8.9% |
| not | 60 | 3.8% |
| really | 47 | 3.0% |
| at | 13 | 0.8% |
| all | 13 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1180 | 12.4% |
| h | 851 | 8.9% |
| c | 596 | 6.3% |
| 528 | 5.5% |
| y | 502 | 5.3% |
| t | 469 | 4.9% |
| o | 456 | 4.8% |
| a | 456 | 4.8% |
| V | 455 | 4.8% |
| r | 455 | 4.8% |
| Other values (14) | 3580 | 37.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7037 | 73.9% |
| Uppercase Letter | 1567 | 16.4% |
| Space Separator | 528 | 5.5% |
| Control | 396 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1180 | 16.8% |
| h | 851 | 12.1% |
| c | 596 | 8.5% |
| y | 502 | 7.1% |
| t | 469 | 6.7% |
| o | 456 | 6.5% |
| a | 456 | 6.5% |
| r | 455 | 6.5% |
| u | 455 | 6.5% |
| d | 423 | 6.0% |
| Other values (5) | 1194 | 17.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 455 | 29.0% |
| M | 455 | 29.0% |
| S | 396 | 25.3% |
| U | 141 | 9.0% |
| N | 60 | 3.8% |
| R | 47 | 3.0% |
| A | 13 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 528 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 396 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8604 | 90.3% |
| Common | 924 | 9.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1180 | 13.7% |
| h | 851 | 9.9% |
| c | 596 | 6.9% |
| y | 502 | 5.8% |
| t | 469 | 5.5% |
| o | 456 | 5.3% |
| a | 456 | 5.3% |
| V | 455 | 5.3% |
| r | 455 | 5.3% |
| M | 455 | 5.3% |
| Other values (12) | 2729 | 31.7% |
Common
| Value | Count | Frequency (%) |
| 528 | 57.1% |
| 396 | 42.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9528 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1180 | 12.4% |
| h | 851 | 8.9% |
| c | 596 | 6.3% |
| 528 | 5.5% |
| y | 502 | 5.3% |
| t | 469 | 4.9% |
| o | 456 | 4.8% |
| a | 456 | 4.8% |
| V | 455 | 4.8% |
| r | 455 | 4.8% |
| Other values (14) | 3580 | 37.6% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.062737643 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9534 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 450 | 28.4% |
| much | 450 | 28.4% |
| somewhat | 404 | 25.5% |
| undecided | 132 | 8.3% |
| not | 66 | 4.2% |
| really | 47 | 3.0% |
| at | 19 | 1.2% |
| all | 19 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1165 | 12.2% |
| h | 854 | 9.0% |
| c | 582 | 6.1% |
| 535 | 5.6% |
| y | 497 | 5.2% |
| t | 489 | 5.1% |
| a | 470 | 4.9% |
| o | 470 | 4.9% |
| V | 450 | 4.7% |
| r | 450 | 4.7% |
| Other values (14) | 3572 | 37.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7027 | 73.7% |
| Uppercase Letter | 1568 | 16.4% |
| Space Separator | 535 | 5.6% |
| Control | 404 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1165 | 16.6% |
| h | 854 | 12.2% |
| c | 582 | 8.3% |
| y | 497 | 7.1% |
| t | 489 | 7.0% |
| a | 470 | 6.7% |
| o | 470 | 6.7% |
| r | 450 | 6.4% |
| u | 450 | 6.4% |
| w | 404 | 5.7% |
| Other values (5) | 1196 | 17.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 450 | 28.7% |
| M | 450 | 28.7% |
| S | 404 | 25.8% |
| U | 132 | 8.4% |
| N | 66 | 4.2% |
| R | 47 | 3.0% |
| A | 19 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 535 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 404 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8595 | 90.2% |
| Common | 939 | 9.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1165 | 13.6% |
| h | 854 | 9.9% |
| c | 582 | 6.8% |
| y | 497 | 5.8% |
| t | 489 | 5.7% |
| a | 470 | 5.5% |
| o | 470 | 5.5% |
| V | 450 | 5.2% |
| r | 450 | 5.2% |
| M | 450 | 5.2% |
| Other values (12) | 2718 | 31.6% |
Common
| Value | Count | Frequency (%) |
| 535 | 57.0% |
| 404 | 43.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9534 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1165 | 12.2% |
| h | 854 | 9.0% |
| c | 582 | 6.1% |
| 535 | 5.6% |
| y | 497 | 5.2% |
| t | 489 | 5.1% |
| a | 470 | 4.9% |
| o | 470 | 4.9% |
| V | 450 | 4.7% |
| r | 450 | 4.7% |
| Other values (14) | 3572 | 37.5% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 9.062737643 |
|---|
| Min length | 9 |
|---|
Characters and Unicode
| Total characters | 9534 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 417 | 26.8% |
| very | 417 | 26.8% |
| much | 417 | 26.8% |
| undecided | 152 | 9.8% |
| not | 66 | 4.2% |
| really | 45 | 2.9% |
| at | 21 | 1.3% |
| all | 21 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1183 | 12.4% |
| h | 834 | 8.7% |
| c | 569 | 6.0% |
| 504 | 5.3% |
| t | 504 | 5.3% |
| a | 483 | 5.1% |
| o | 483 | 5.1% |
| y | 462 | 4.8% |
| d | 456 | 4.8% |
| u | 417 | 4.4% |
| Other values (14) | 3639 | 38.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7078 | 74.2% |
| Uppercase Letter | 1535 | 16.1% |
| Space Separator | 504 | 5.3% |
| Control | 417 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1183 | 16.7% |
| h | 834 | 11.8% |
| c | 569 | 8.0% |
| t | 504 | 7.1% |
| a | 483 | 6.8% |
| o | 483 | 6.8% |
| y | 462 | 6.5% |
| d | 456 | 6.4% |
| u | 417 | 5.9% |
| r | 417 | 5.9% |
| Other values (5) | 1270 | 17.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 417 | 27.2% |
| S | 417 | 27.2% |
| V | 417 | 27.2% |
| U | 152 | 9.9% |
| N | 66 | 4.3% |
| R | 45 | 2.9% |
| A | 21 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 504 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 417 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8613 | 90.3% |
| Common | 921 | 9.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1183 | 13.7% |
| h | 834 | 9.7% |
| c | 569 | 6.6% |
| t | 504 | 5.9% |
| a | 483 | 5.6% |
| o | 483 | 5.6% |
| y | 462 | 5.4% |
| d | 456 | 5.3% |
| u | 417 | 4.8% |
| M | 417 | 4.8% |
| Other values (12) | 2805 | 32.6% |
Common
| Value | Count | Frequency (%) |
| 504 | 54.7% |
| 417 | 45.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9534 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1183 | 12.4% |
| h | 834 | 8.7% |
| c | 569 | 6.0% |
| 504 | 5.3% |
| t | 504 | 5.3% |
| a | 483 | 5.1% |
| o | 483 | 5.1% |
| y | 462 | 4.8% |
| d | 456 | 4.8% |
| u | 417 | 4.4% |
| Other values (14) | 3639 | 38.2% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.739543726 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9194 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 546 | 32.3% |
| much | 546 | 32.3% |
| somewhat | 353 | 20.9% |
| not | 79 | 4.7% |
| undecided | 74 | 4.4% |
| really | 64 | 3.8% |
| at | 15 | 0.9% |
| all | 15 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1111 | 12.1% |
| h | 899 | 9.8% |
| 640 | 7.0% |
| c | 620 | 6.7% |
| y | 610 | 6.6% |
| V | 546 | 5.9% |
| r | 546 | 5.9% |
| M | 546 | 5.9% |
| u | 546 | 5.9% |
| t | 447 | 4.9% |
| Other values (13) | 2683 | 29.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6877 | 74.8% |
| Uppercase Letter | 1677 | 18.2% |
| Space Separator | 640 | 7.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1111 | 16.2% |
| h | 899 | 13.1% |
| c | 620 | 9.0% |
| y | 610 | 8.9% |
| r | 546 | 7.9% |
| u | 546 | 7.9% |
| t | 447 | 6.5% |
| a | 432 | 6.3% |
| o | 432 | 6.3% |
| m | 353 | 5.1% |
| Other values (5) | 881 | 12.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 546 | 32.6% |
| M | 546 | 32.6% |
| S | 353 | 21.0% |
| N | 79 | 4.7% |
| U | 74 | 4.4% |
| R | 64 | 3.8% |
| A | 15 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 640 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8554 | 93.0% |
| Common | 640 | 7.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1111 | 13.0% |
| h | 899 | 10.5% |
| c | 620 | 7.2% |
| y | 610 | 7.1% |
| V | 546 | 6.4% |
| r | 546 | 6.4% |
| M | 546 | 6.4% |
| u | 546 | 6.4% |
| t | 447 | 5.2% |
| a | 432 | 5.1% |
| Other values (12) | 2251 | 26.3% |
Common
| Value | Count | Frequency (%) |
| 640 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9194 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1111 | 12.1% |
| h | 899 | 9.8% |
| 640 | 7.0% |
| c | 620 | 6.7% |
| y | 610 | 6.6% |
| V | 546 | 5.9% |
| r | 546 | 5.9% |
| M | 546 | 5.9% |
| u | 546 | 5.9% |
| t | 447 | 4.9% |
| Other values (13) | 2683 | 29.2% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.715779468 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9169 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Not Really |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 425 | 26.4% |
| much | 425 | 26.4% |
| somewhat | 404 | 25.1% |
| undecided | 118 | 7.3% |
| not | 105 | 6.5% |
| really | 79 | 4.9% |
| at | 26 | 1.6% |
| all | 26 | 1.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1144 | 12.5% |
| h | 829 | 9.0% |
| 556 | 6.1% |
| c | 543 | 5.9% |
| t | 535 | 5.8% |
| a | 509 | 5.6% |
| o | 509 | 5.6% |
| y | 504 | 5.5% |
| V | 425 | 4.6% |
| r | 425 | 4.6% |
| Other values (13) | 3190 | 34.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7031 | 76.7% |
| Uppercase Letter | 1582 | 17.3% |
| Space Separator | 556 | 6.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1144 | 16.3% |
| h | 829 | 11.8% |
| c | 543 | 7.7% |
| t | 535 | 7.6% |
| a | 509 | 7.2% |
| o | 509 | 7.2% |
| y | 504 | 7.2% |
| r | 425 | 6.0% |
| u | 425 | 6.0% |
| m | 404 | 5.7% |
| Other values (5) | 1204 | 17.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 425 | 26.9% |
| M | 425 | 26.9% |
| S | 404 | 25.5% |
| U | 118 | 7.5% |
| N | 105 | 6.6% |
| R | 79 | 5.0% |
| A | 26 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 556 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8613 | 93.9% |
| Common | 556 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1144 | 13.3% |
| h | 829 | 9.6% |
| c | 543 | 6.3% |
| t | 535 | 6.2% |
| a | 509 | 5.9% |
| o | 509 | 5.9% |
| y | 504 | 5.9% |
| V | 425 | 4.9% |
| r | 425 | 4.9% |
| M | 425 | 4.9% |
| Other values (12) | 2765 | 32.1% |
Common
| Value | Count | Frequency (%) |
| 556 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9169 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1144 | 12.5% |
| h | 829 | 9.0% |
| 556 | 6.1% |
| c | 543 | 5.9% |
| t | 535 | 5.8% |
| a | 509 | 5.6% |
| o | 509 | 5.6% |
| y | 504 | 5.5% |
| V | 425 | 4.6% |
| r | 425 | 4.6% |
| Other values (13) | 3190 | 34.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.741444867 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9196 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Undecided |
|---|
| 4th row | Not Really |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| very | 433 | 26.6% |
| much | 433 | 26.6% |
| somewhat | 379 | 23.3% |
| undecided | 133 | 8.2% |
| not | 107 | 6.6% |
| really | 74 | 4.6% |
| at | 33 | 2.0% |
| all | 33 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1152 | 12.5% |
| h | 812 | 8.8% |
| 573 | 6.2% |
| c | 566 | 6.2% |
| t | 519 | 5.6% |
| y | 507 | 5.5% |
| a | 486 | 5.3% |
| o | 486 | 5.3% |
| V | 433 | 4.7% |
| r | 433 | 4.7% |
| Other values (13) | 3229 | 35.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7031 | 76.5% |
| Uppercase Letter | 1592 | 17.3% |
| Space Separator | 573 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1152 | 16.4% |
| h | 812 | 11.5% |
| c | 566 | 8.1% |
| t | 519 | 7.4% |
| y | 507 | 7.2% |
| a | 486 | 6.9% |
| o | 486 | 6.9% |
| r | 433 | 6.2% |
| u | 433 | 6.2% |
| d | 399 | 5.7% |
| Other values (5) | 1238 | 17.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 433 | 27.2% |
| M | 433 | 27.2% |
| S | 379 | 23.8% |
| U | 133 | 8.4% |
| N | 107 | 6.7% |
| R | 74 | 4.6% |
| A | 33 | 2.1% |
Space Separator
| Value | Count | Frequency (%) |
| 573 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8623 | 93.8% |
| Common | 573 | 6.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1152 | 13.4% |
| h | 812 | 9.4% |
| c | 566 | 6.6% |
| t | 519 | 6.0% |
| y | 507 | 5.9% |
| a | 486 | 5.6% |
| o | 486 | 5.6% |
| V | 433 | 5.0% |
| r | 433 | 5.0% |
| M | 433 | 5.0% |
| Other values (12) | 2796 | 32.4% |
Common
| Value | Count | Frequency (%) |
| 573 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9196 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1152 | 12.5% |
| h | 812 | 8.8% |
| 573 | 6.2% |
| c | 566 | 6.2% |
| t | 519 | 5.6% |
| y | 507 | 5.5% |
| a | 486 | 5.3% |
| o | 486 | 5.3% |
| V | 433 | 4.7% |
| r | 433 | 4.7% |
| Other values (13) | 3229 | 35.1% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.689163498 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9141 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| very | 537 | 32.7% |
| much | 537 | 32.7% |
| somewhat | 368 | 22.4% |
| undecided | 106 | 6.5% |
| not | 41 | 2.5% |
| really | 30 | 1.8% |
| at | 11 | 0.7% |
| all | 11 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1147 | 12.5% |
| h | 905 | 9.9% |
| c | 643 | 7.0% |
| 589 | 6.4% |
| y | 567 | 6.2% |
| V | 537 | 5.9% |
| r | 537 | 5.9% |
| M | 537 | 5.9% |
| u | 537 | 5.9% |
| t | 420 | 4.6% |
| Other values (13) | 2722 | 29.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6922 | 75.7% |
| Uppercase Letter | 1630 | 17.8% |
| Space Separator | 589 | 6.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1147 | 16.6% |
| h | 905 | 13.1% |
| c | 643 | 9.3% |
| y | 567 | 8.2% |
| r | 537 | 7.8% |
| u | 537 | 7.8% |
| t | 420 | 6.1% |
| a | 409 | 5.9% |
| o | 409 | 5.9% |
| m | 368 | 5.3% |
| Other values (5) | 980 | 14.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 537 | 32.9% |
| M | 537 | 32.9% |
| S | 368 | 22.6% |
| U | 106 | 6.5% |
| N | 41 | 2.5% |
| R | 30 | 1.8% |
| A | 11 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 589 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8552 | 93.6% |
| Common | 589 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1147 | 13.4% |
| h | 905 | 10.6% |
| c | 643 | 7.5% |
| y | 567 | 6.6% |
| V | 537 | 6.3% |
| r | 537 | 6.3% |
| M | 537 | 6.3% |
| u | 537 | 6.3% |
| t | 420 | 4.9% |
| a | 409 | 4.8% |
| Other values (12) | 2313 | 27.0% |
Common
| Value | Count | Frequency (%) |
| 589 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9141 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1147 | 12.5% |
| h | 905 | 9.9% |
| c | 643 | 7.0% |
| 589 | 6.4% |
| y | 567 | 6.2% |
| V | 537 | 5.9% |
| r | 537 | 5.9% |
| M | 537 | 5.9% |
| u | 537 | 5.9% |
| t | 420 | 4.6% |
| Other values (13) | 2722 | 29.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.673954373 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9125 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Not Really |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 460 | 29.1% |
| much | 460 | 29.1% |
| somewhat | 399 | 25.3% |
| undecided | 137 | 8.7% |
| not | 56 | 3.5% |
| really | 45 | 2.8% |
| at | 11 | 0.7% |
| all | 11 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1178 | 12.9% |
| h | 859 | 9.4% |
| c | 597 | 6.5% |
| 527 | 5.8% |
| y | 505 | 5.5% |
| t | 466 | 5.1% |
| V | 460 | 5.0% |
| r | 460 | 5.0% |
| M | 460 | 5.0% |
| u | 460 | 5.0% |
| Other values (13) | 3153 | 34.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7030 | 77.0% |
| Uppercase Letter | 1568 | 17.2% |
| Space Separator | 527 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1178 | 16.8% |
| h | 859 | 12.2% |
| c | 597 | 8.5% |
| y | 505 | 7.2% |
| t | 466 | 6.6% |
| r | 460 | 6.5% |
| u | 460 | 6.5% |
| a | 455 | 6.5% |
| o | 455 | 6.5% |
| d | 411 | 5.8% |
| Other values (5) | 1184 | 16.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 460 | 29.3% |
| M | 460 | 29.3% |
| S | 399 | 25.4% |
| U | 137 | 8.7% |
| N | 56 | 3.6% |
| R | 45 | 2.9% |
| A | 11 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 527 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8598 | 94.2% |
| Common | 527 | 5.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1178 | 13.7% |
| h | 859 | 10.0% |
| c | 597 | 6.9% |
| y | 505 | 5.9% |
| t | 466 | 5.4% |
| V | 460 | 5.4% |
| r | 460 | 5.4% |
| M | 460 | 5.4% |
| u | 460 | 5.4% |
| a | 455 | 5.3% |
| Other values (12) | 2698 | 31.4% |
Common
| Value | Count | Frequency (%) |
| 527 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9125 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1178 | 12.9% |
| h | 859 | 9.4% |
| c | 597 | 6.5% |
| 527 | 5.8% |
| y | 505 | 5.5% |
| t | 466 | 5.1% |
| V | 460 | 5.0% |
| r | 460 | 5.0% |
| M | 460 | 5.0% |
| u | 460 | 5.0% |
| Other values (13) | 3153 | 34.6% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.682509506 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9134 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 525 | 32.0% |
| much | 525 | 32.0% |
| somewhat | 382 | 23.3% |
| undecided | 97 | 5.9% |
| not | 48 | 2.9% |
| really | 33 | 2.0% |
| at | 15 | 0.9% |
| all | 15 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1134 | 12.4% |
| h | 907 | 9.9% |
| c | 622 | 6.8% |
| 588 | 6.4% |
| y | 558 | 6.1% |
| V | 525 | 5.7% |
| r | 525 | 5.7% |
| M | 525 | 5.7% |
| u | 525 | 5.7% |
| t | 445 | 4.9% |
| Other values (13) | 2780 | 30.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6921 | 75.8% |
| Uppercase Letter | 1625 | 17.8% |
| Space Separator | 588 | 6.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1134 | 16.4% |
| h | 907 | 13.1% |
| c | 622 | 9.0% |
| y | 558 | 8.1% |
| r | 525 | 7.6% |
| u | 525 | 7.6% |
| t | 445 | 6.4% |
| a | 430 | 6.2% |
| o | 430 | 6.2% |
| m | 382 | 5.5% |
| Other values (5) | 963 | 13.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 525 | 32.3% |
| M | 525 | 32.3% |
| S | 382 | 23.5% |
| U | 97 | 6.0% |
| N | 48 | 3.0% |
| R | 33 | 2.0% |
| A | 15 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 588 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8546 | 93.6% |
| Common | 588 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1134 | 13.3% |
| h | 907 | 10.6% |
| c | 622 | 7.3% |
| y | 558 | 6.5% |
| V | 525 | 6.1% |
| r | 525 | 6.1% |
| M | 525 | 6.1% |
| u | 525 | 6.1% |
| t | 445 | 5.2% |
| a | 430 | 5.0% |
| Other values (12) | 2350 | 27.5% |
Common
| Value | Count | Frequency (%) |
| 588 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9134 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1134 | 12.4% |
| h | 907 | 9.9% |
| c | 622 | 6.8% |
| 588 | 6.4% |
| y | 558 | 6.1% |
| V | 525 | 5.7% |
| r | 525 | 5.7% |
| M | 525 | 5.7% |
| u | 525 | 5.7% |
| t | 445 | 4.9% |
| Other values (13) | 2780 | 30.4% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.706273764 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9159 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 444 | 30.2% |
| very | 246 | 16.8% |
| much | 246 | 16.8% |
| undecided | 227 | 15.5% |
| not | 135 | 9.2% |
| really | 100 | 6.8% |
| at | 35 | 2.4% |
| all | 35 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1244 | 13.6% |
| h | 690 | 7.5% |
| d | 681 | 7.4% |
| t | 614 | 6.7% |
| o | 579 | 6.3% |
| a | 579 | 6.3% |
| c | 473 | 5.2% |
| S | 444 | 4.8% |
| m | 444 | 4.8% |
| w | 444 | 4.8% |
| Other values (13) | 2967 | 32.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7310 | 79.8% |
| Uppercase Letter | 1433 | 15.6% |
| Space Separator | 416 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1244 | 17.0% |
| h | 690 | 9.4% |
| d | 681 | 9.3% |
| t | 614 | 8.4% |
| o | 579 | 7.9% |
| a | 579 | 7.9% |
| c | 473 | 6.5% |
| m | 444 | 6.1% |
| w | 444 | 6.1% |
| y | 346 | 4.7% |
| Other values (5) | 1216 | 16.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 444 | 31.0% |
| M | 246 | 17.2% |
| V | 246 | 17.2% |
| U | 227 | 15.8% |
| N | 135 | 9.4% |
| R | 100 | 7.0% |
| A | 35 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 416 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8743 | 95.5% |
| Common | 416 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1244 | 14.2% |
| h | 690 | 7.9% |
| d | 681 | 7.8% |
| t | 614 | 7.0% |
| o | 579 | 6.6% |
| a | 579 | 6.6% |
| c | 473 | 5.4% |
| S | 444 | 5.1% |
| m | 444 | 5.1% |
| w | 444 | 5.1% |
| Other values (12) | 2551 | 29.2% |
Common
| Value | Count | Frequency (%) |
| 416 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9159 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1244 | 13.6% |
| h | 690 | 7.5% |
| d | 681 | 7.4% |
| t | 614 | 6.7% |
| o | 579 | 6.3% |
| a | 579 | 6.3% |
| c | 473 | 5.2% |
| S | 444 | 4.8% |
| m | 444 | 4.8% |
| w | 444 | 4.8% |
| Other values (13) | 2967 | 32.4% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.782319392 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9239 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Not Really |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 380 | 25.4% |
| undecided | 289 | 19.3% |
| very | 232 | 15.5% |
| much | 232 | 15.5% |
| not | 151 | 10.1% |
| really | 88 | 5.9% |
| at | 63 | 4.2% |
| all | 63 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1278 | 13.8% |
| d | 867 | 9.4% |
| h | 612 | 6.6% |
| t | 594 | 6.4% |
| o | 531 | 5.7% |
| a | 531 | 5.7% |
| c | 521 | 5.6% |
| 446 | 4.8% |
| S | 380 | 4.1% |
| w | 380 | 4.1% |
| Other values (13) | 3099 | 33.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7358 | 79.6% |
| Uppercase Letter | 1435 | 15.5% |
| Space Separator | 446 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1278 | 17.4% |
| d | 867 | 11.8% |
| h | 612 | 8.3% |
| t | 594 | 8.1% |
| o | 531 | 7.2% |
| a | 531 | 7.2% |
| c | 521 | 7.1% |
| w | 380 | 5.2% |
| m | 380 | 5.2% |
| y | 320 | 4.3% |
| Other values (5) | 1344 | 18.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 380 | 26.5% |
| U | 289 | 20.1% |
| V | 232 | 16.2% |
| M | 232 | 16.2% |
| N | 151 | 10.5% |
| R | 88 | 6.1% |
| A | 63 | 4.4% |
Space Separator
| Value | Count | Frequency (%) |
| 446 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8793 | 95.2% |
| Common | 446 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1278 | 14.5% |
| d | 867 | 9.9% |
| h | 612 | 7.0% |
| t | 594 | 6.8% |
| o | 531 | 6.0% |
| a | 531 | 6.0% |
| c | 521 | 5.9% |
| S | 380 | 4.3% |
| w | 380 | 4.3% |
| m | 380 | 4.3% |
| Other values (12) | 2719 | 30.9% |
Common
| Value | Count | Frequency (%) |
| 446 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9239 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1278 | 13.8% |
| d | 867 | 9.4% |
| h | 612 | 6.6% |
| t | 594 | 6.4% |
| o | 531 | 5.7% |
| a | 531 | 5.7% |
| c | 521 | 5.6% |
| 446 | 4.8% |
| S | 380 | 4.1% |
| w | 380 | 4.1% |
| Other values (13) | 3099 | 33.5% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.708174905 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9161 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Not Really |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 409 | 26.6% |
| very | 358 | 23.3% |
| much | 358 | 23.3% |
| undecided | 183 | 11.9% |
| not | 102 | 6.6% |
| really | 75 | 4.9% |
| at | 27 | 1.8% |
| all | 27 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1208 | 13.2% |
| h | 767 | 8.4% |
| d | 549 | 6.0% |
| c | 541 | 5.9% |
| t | 538 | 5.9% |
| a | 511 | 5.6% |
| o | 511 | 5.6% |
| 487 | 5.3% |
| y | 433 | 4.7% |
| S | 409 | 4.5% |
| Other values (13) | 3207 | 35.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7162 | 78.2% |
| Uppercase Letter | 1512 | 16.5% |
| Space Separator | 487 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1208 | 16.9% |
| h | 767 | 10.7% |
| d | 549 | 7.7% |
| c | 541 | 7.6% |
| t | 538 | 7.5% |
| a | 511 | 7.1% |
| o | 511 | 7.1% |
| y | 433 | 6.0% |
| w | 409 | 5.7% |
| m | 409 | 5.7% |
| Other values (5) | 1286 | 18.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 409 | 27.1% |
| V | 358 | 23.7% |
| M | 358 | 23.7% |
| U | 183 | 12.1% |
| N | 102 | 6.7% |
| R | 75 | 5.0% |
| A | 27 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 487 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8674 | 94.7% |
| Common | 487 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1208 | 13.9% |
| h | 767 | 8.8% |
| d | 549 | 6.3% |
| c | 541 | 6.2% |
| t | 538 | 6.2% |
| a | 511 | 5.9% |
| o | 511 | 5.9% |
| y | 433 | 5.0% |
| S | 409 | 4.7% |
| w | 409 | 4.7% |
| Other values (12) | 2798 | 32.3% |
Common
| Value | Count | Frequency (%) |
| 487 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9161 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1208 | 13.2% |
| h | 767 | 8.4% |
| d | 549 | 6.0% |
| c | 541 | 5.9% |
| t | 538 | 5.9% |
| a | 511 | 5.6% |
| o | 511 | 5.6% |
| 487 | 5.3% |
| y | 433 | 4.7% |
| S | 409 | 4.5% |
| Other values (13) | 3207 | 35.0% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.772813688 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9229 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 401 | 27.4% |
| undecided | 289 | 19.7% |
| very | 200 | 13.7% |
| much | 200 | 13.7% |
| not | 162 | 11.1% |
| really | 111 | 7.6% |
| at | 51 | 3.5% |
| all | 51 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1290 | 14.0% |
| d | 867 | 9.4% |
| t | 614 | 6.7% |
| h | 601 | 6.5% |
| o | 563 | 6.1% |
| a | 563 | 6.1% |
| c | 489 | 5.3% |
| 413 | 4.5% |
| S | 401 | 4.3% |
| w | 401 | 4.3% |
| Other values (13) | 3027 | 32.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7402 | 80.2% |
| Uppercase Letter | 1414 | 15.3% |
| Space Separator | 413 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1290 | 17.4% |
| d | 867 | 11.7% |
| t | 614 | 8.3% |
| h | 601 | 8.1% |
| o | 563 | 7.6% |
| a | 563 | 7.6% |
| c | 489 | 6.6% |
| w | 401 | 5.4% |
| m | 401 | 5.4% |
| l | 324 | 4.4% |
| Other values (5) | 1289 | 17.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 401 | 28.4% |
| U | 289 | 20.4% |
| V | 200 | 14.1% |
| M | 200 | 14.1% |
| N | 162 | 11.5% |
| R | 111 | 7.9% |
| A | 51 | 3.6% |
Space Separator
| Value | Count | Frequency (%) |
| 413 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8816 | 95.5% |
| Common | 413 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1290 | 14.6% |
| d | 867 | 9.8% |
| t | 614 | 7.0% |
| h | 601 | 6.8% |
| o | 563 | 6.4% |
| a | 563 | 6.4% |
| c | 489 | 5.5% |
| S | 401 | 4.5% |
| w | 401 | 4.5% |
| m | 401 | 4.5% |
| Other values (12) | 2626 | 29.8% |
Common
| Value | Count | Frequency (%) |
| 413 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9229 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1290 | 14.0% |
| d | 867 | 9.4% |
| t | 614 | 6.7% |
| h | 601 | 6.5% |
| o | 563 | 6.1% |
| a | 563 | 6.1% |
| c | 489 | 5.3% |
| 413 | 4.5% |
| S | 401 | 4.3% |
| w | 401 | 4.3% |
| Other values (13) | 3027 | 32.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.685361217 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9137 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Not Really |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 444 | 30.7% |
| undecided | 252 | 17.4% |
| very | 243 | 16.8% |
| much | 243 | 16.8% |
| not | 113 | 7.8% |
| really | 75 | 5.2% |
| at | 38 | 2.6% |
| all | 38 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1266 | 13.9% |
| d | 756 | 8.3% |
| h | 687 | 7.5% |
| t | 595 | 6.5% |
| o | 557 | 6.1% |
| a | 557 | 6.1% |
| c | 495 | 5.4% |
| S | 444 | 4.9% |
| w | 444 | 4.9% |
| m | 444 | 4.9% |
| Other values (13) | 2892 | 31.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7335 | 80.3% |
| Uppercase Letter | 1408 | 15.4% |
| Space Separator | 394 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1266 | 17.3% |
| d | 756 | 10.3% |
| h | 687 | 9.4% |
| t | 595 | 8.1% |
| o | 557 | 7.6% |
| a | 557 | 7.6% |
| c | 495 | 6.7% |
| w | 444 | 6.1% |
| m | 444 | 6.1% |
| y | 318 | 4.3% |
| Other values (5) | 1216 | 16.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 444 | 31.5% |
| U | 252 | 17.9% |
| V | 243 | 17.3% |
| M | 243 | 17.3% |
| N | 113 | 8.0% |
| R | 75 | 5.3% |
| A | 38 | 2.7% |
Space Separator
| Value | Count | Frequency (%) |
| 394 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8743 | 95.7% |
| Common | 394 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1266 | 14.5% |
| d | 756 | 8.6% |
| h | 687 | 7.9% |
| t | 595 | 6.8% |
| o | 557 | 6.4% |
| a | 557 | 6.4% |
| c | 495 | 5.7% |
| S | 444 | 5.1% |
| w | 444 | 5.1% |
| m | 444 | 5.1% |
| Other values (12) | 2498 | 28.6% |
Common
| Value | Count | Frequency (%) |
| 394 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9137 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1266 | 13.9% |
| d | 756 | 8.3% |
| h | 687 | 7.5% |
| t | 595 | 6.5% |
| o | 557 | 6.1% |
| a | 557 | 6.1% |
| c | 495 | 5.4% |
| S | 444 | 4.9% |
| w | 444 | 4.9% |
| m | 444 | 4.9% |
| Other values (13) | 2892 | 31.7% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.786121673 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9243 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Not Really |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| very | 373 | 23.2% |
| much | 373 | 23.2% |
| somewhat | 359 | 22.3% |
| undecided | 186 | 11.6% |
| not | 134 | 8.3% |
| really | 86 | 5.4% |
| at | 48 | 3.0% |
| all | 48 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1190 | 12.9% |
| h | 732 | 7.9% |
| c | 559 | 6.0% |
| d | 558 | 6.0% |
| 555 | 6.0% |
| t | 541 | 5.9% |
| a | 493 | 5.3% |
| o | 493 | 5.3% |
| y | 459 | 5.0% |
| V | 373 | 4.0% |
| Other values (13) | 3290 | 35.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7129 | 77.1% |
| Uppercase Letter | 1559 | 16.9% |
| Space Separator | 555 | 6.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1190 | 16.7% |
| h | 732 | 10.3% |
| c | 559 | 7.8% |
| d | 558 | 7.8% |
| t | 541 | 7.6% |
| a | 493 | 6.9% |
| o | 493 | 6.9% |
| y | 459 | 6.4% |
| r | 373 | 5.2% |
| u | 373 | 5.2% |
| Other values (5) | 1358 | 19.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 373 | 23.9% |
| M | 373 | 23.9% |
| S | 359 | 23.0% |
| U | 186 | 11.9% |
| N | 134 | 8.6% |
| R | 86 | 5.5% |
| A | 48 | 3.1% |
Space Separator
| Value | Count | Frequency (%) |
| 555 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8688 | 94.0% |
| Common | 555 | 6.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1190 | 13.7% |
| h | 732 | 8.4% |
| c | 559 | 6.4% |
| d | 558 | 6.4% |
| t | 541 | 6.2% |
| a | 493 | 5.7% |
| o | 493 | 5.7% |
| y | 459 | 5.3% |
| V | 373 | 4.3% |
| r | 373 | 4.3% |
| Other values (12) | 2917 | 33.6% |
Common
| Value | Count | Frequency (%) |
| 555 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9243 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1190 | 12.9% |
| h | 732 | 7.9% |
| c | 559 | 6.0% |
| d | 558 | 6.0% |
| 555 | 6.0% |
| t | 541 | 5.9% |
| a | 493 | 5.3% |
| o | 493 | 5.3% |
| y | 459 | 5.0% |
| V | 373 | 4.0% |
| Other values (13) | 3290 | 35.6% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.807034221 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9265 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 382 | 26.0% |
| undecided | 311 | 21.1% |
| very | 180 | 12.2% |
| much | 180 | 12.2% |
| not | 179 | 12.2% |
| really | 119 | 8.1% |
| at | 60 | 4.1% |
| all | 60 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1303 | 14.1% |
| d | 933 | 10.1% |
| t | 621 | 6.7% |
| h | 562 | 6.1% |
| o | 561 | 6.1% |
| a | 561 | 6.1% |
| c | 491 | 5.3% |
| 419 | 4.5% |
| S | 382 | 4.1% |
| w | 382 | 4.1% |
| Other values (13) | 3050 | 32.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7435 | 80.2% |
| Uppercase Letter | 1411 | 15.2% |
| Space Separator | 419 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1303 | 17.5% |
| d | 933 | 12.5% |
| t | 621 | 8.4% |
| h | 562 | 7.6% |
| o | 561 | 7.5% |
| a | 561 | 7.5% |
| c | 491 | 6.6% |
| w | 382 | 5.1% |
| m | 382 | 5.1% |
| l | 358 | 4.8% |
| Other values (5) | 1281 | 17.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 382 | 27.1% |
| U | 311 | 22.0% |
| V | 180 | 12.8% |
| M | 180 | 12.8% |
| N | 179 | 12.7% |
| R | 119 | 8.4% |
| A | 60 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 419 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8846 | 95.5% |
| Common | 419 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1303 | 14.7% |
| d | 933 | 10.5% |
| t | 621 | 7.0% |
| h | 562 | 6.4% |
| o | 561 | 6.3% |
| a | 561 | 6.3% |
| c | 491 | 5.6% |
| S | 382 | 4.3% |
| w | 382 | 4.3% |
| m | 382 | 4.3% |
| Other values (12) | 2668 | 30.2% |
Common
| Value | Count | Frequency (%) |
| 419 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9265 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1303 | 14.1% |
| d | 933 | 10.1% |
| t | 621 | 6.7% |
| h | 562 | 6.1% |
| o | 561 | 6.1% |
| a | 561 | 6.1% |
| c | 491 | 5.3% |
| 419 | 4.5% |
| S | 382 | 4.1% |
| w | 382 | 4.1% |
| Other values (13) | 3050 | 32.9% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.679657795 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9131 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 423 | 27.1% |
| much | 423 | 27.1% |
| somewhat | 404 | 25.9% |
| undecided | 158 | 10.1% |
| not | 67 | 4.3% |
| really | 50 | 3.2% |
| at | 17 | 1.1% |
| all | 17 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1193 | 13.1% |
| h | 827 | 9.1% |
| c | 581 | 6.4% |
| 507 | 5.6% |
| t | 488 | 5.3% |
| d | 474 | 5.2% |
| y | 473 | 5.2% |
| a | 471 | 5.2% |
| o | 471 | 5.2% |
| V | 423 | 4.6% |
| Other values (13) | 3223 | 35.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7082 | 77.6% |
| Uppercase Letter | 1542 | 16.9% |
| Space Separator | 507 | 5.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1193 | 16.8% |
| h | 827 | 11.7% |
| c | 581 | 8.2% |
| t | 488 | 6.9% |
| d | 474 | 6.7% |
| y | 473 | 6.7% |
| a | 471 | 6.7% |
| o | 471 | 6.7% |
| r | 423 | 6.0% |
| u | 423 | 6.0% |
| Other values (5) | 1258 | 17.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 423 | 27.4% |
| M | 423 | 27.4% |
| S | 404 | 26.2% |
| U | 158 | 10.2% |
| N | 67 | 4.3% |
| R | 50 | 3.2% |
| A | 17 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 507 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8624 | 94.4% |
| Common | 507 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1193 | 13.8% |
| h | 827 | 9.6% |
| c | 581 | 6.7% |
| t | 488 | 5.7% |
| d | 474 | 5.5% |
| y | 473 | 5.5% |
| a | 471 | 5.5% |
| o | 471 | 5.5% |
| V | 423 | 4.9% |
| r | 423 | 4.9% |
| Other values (12) | 2800 | 32.5% |
Common
| Value | Count | Frequency (%) |
| 507 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9131 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1193 | 13.1% |
| h | 827 | 9.1% |
| c | 581 | 6.4% |
| 507 | 5.6% |
| t | 488 | 5.3% |
| d | 474 | 5.2% |
| y | 473 | 5.2% |
| a | 471 | 5.2% |
| o | 471 | 5.2% |
| V | 423 | 4.6% |
| Other values (13) | 3223 | 35.3% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.706273764 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9159 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Not Really |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 437 | 30.2% |
| undecided | 258 | 17.9% |
| very | 229 | 15.8% |
| much | 229 | 15.8% |
| not | 128 | 8.9% |
| really | 92 | 6.4% |
| at | 36 | 2.5% |
| all | 36 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1274 | 13.9% |
| d | 774 | 8.5% |
| h | 666 | 7.3% |
| t | 601 | 6.6% |
| o | 565 | 6.2% |
| a | 565 | 6.2% |
| c | 487 | 5.3% |
| S | 437 | 4.8% |
| w | 437 | 4.8% |
| m | 437 | 4.8% |
| Other values (13) | 2916 | 31.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7357 | 80.3% |
| Uppercase Letter | 1409 | 15.4% |
| Space Separator | 393 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1274 | 17.3% |
| d | 774 | 10.5% |
| h | 666 | 9.1% |
| t | 601 | 8.2% |
| o | 565 | 7.7% |
| a | 565 | 7.7% |
| c | 487 | 6.6% |
| w | 437 | 5.9% |
| m | 437 | 5.9% |
| y | 321 | 4.4% |
| Other values (5) | 1230 | 16.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 437 | 31.0% |
| U | 258 | 18.3% |
| V | 229 | 16.3% |
| M | 229 | 16.3% |
| N | 128 | 9.1% |
| R | 92 | 6.5% |
| A | 36 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 393 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8766 | 95.7% |
| Common | 393 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1274 | 14.5% |
| d | 774 | 8.8% |
| h | 666 | 7.6% |
| t | 601 | 6.9% |
| o | 565 | 6.4% |
| a | 565 | 6.4% |
| c | 487 | 5.6% |
| S | 437 | 5.0% |
| w | 437 | 5.0% |
| m | 437 | 5.0% |
| Other values (12) | 2523 | 28.8% |
Common
| Value | Count | Frequency (%) |
| 393 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9159 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1274 | 13.9% |
| d | 774 | 8.5% |
| h | 666 | 7.3% |
| t | 601 | 6.6% |
| o | 565 | 6.2% |
| a | 565 | 6.2% |
| c | 487 | 5.3% |
| S | 437 | 4.8% |
| w | 437 | 4.8% |
| m | 437 | 4.8% |
| Other values (13) | 2916 | 31.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.705323194 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9158 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Undecided |
|---|
| 4th row | Not Really |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 420 | 28.1% |
| very | 302 | 20.2% |
| much | 302 | 20.2% |
| undecided | 220 | 14.7% |
| not | 110 | 7.4% |
| really | 80 | 5.4% |
| at | 30 | 2.0% |
| all | 30 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1242 | 13.6% |
| h | 722 | 7.9% |
| d | 660 | 7.2% |
| t | 560 | 6.1% |
| o | 530 | 5.8% |
| a | 530 | 5.8% |
| c | 522 | 5.7% |
| 442 | 4.8% |
| S | 420 | 4.6% |
| w | 420 | 4.6% |
| Other values (13) | 3110 | 34.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7252 | 79.2% |
| Uppercase Letter | 1464 | 16.0% |
| Space Separator | 442 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1242 | 17.1% |
| h | 722 | 10.0% |
| d | 660 | 9.1% |
| t | 560 | 7.7% |
| o | 530 | 7.3% |
| a | 530 | 7.3% |
| c | 522 | 7.2% |
| w | 420 | 5.8% |
| m | 420 | 5.8% |
| y | 382 | 5.3% |
| Other values (5) | 1264 | 17.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 420 | 28.7% |
| V | 302 | 20.6% |
| M | 302 | 20.6% |
| U | 220 | 15.0% |
| N | 110 | 7.5% |
| R | 80 | 5.5% |
| A | 30 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 442 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8716 | 95.2% |
| Common | 442 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1242 | 14.2% |
| h | 722 | 8.3% |
| d | 660 | 7.6% |
| t | 560 | 6.4% |
| o | 530 | 6.1% |
| a | 530 | 6.1% |
| c | 522 | 6.0% |
| S | 420 | 4.8% |
| w | 420 | 4.8% |
| m | 420 | 4.8% |
| Other values (12) | 2690 | 30.9% |
Common
| Value | Count | Frequency (%) |
| 442 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9158 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1242 | 13.6% |
| h | 722 | 7.9% |
| d | 660 | 7.2% |
| t | 560 | 6.1% |
| o | 530 | 5.8% |
| a | 530 | 5.8% |
| c | 522 | 5.7% |
| 442 | 4.8% |
| S | 420 | 4.6% |
| w | 420 | 4.6% |
| Other values (13) | 3110 | 34.0% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.66634981 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9117 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 442 | 30.0% |
| very | 305 | 20.7% |
| much | 305 | 20.7% |
| undecided | 214 | 14.5% |
| not | 91 | 6.2% |
| really | 65 | 4.4% |
| at | 26 | 1.8% |
| all | 26 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1240 | 13.6% |
| h | 747 | 8.2% |
| d | 642 | 7.0% |
| t | 559 | 6.1% |
| o | 533 | 5.8% |
| a | 533 | 5.8% |
| c | 519 | 5.7% |
| S | 442 | 4.8% |
| m | 442 | 4.8% |
| w | 442 | 4.8% |
| Other values (13) | 3018 | 33.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7247 | 79.5% |
| Uppercase Letter | 1448 | 15.9% |
| Space Separator | 422 | 4.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1240 | 17.1% |
| h | 747 | 10.3% |
| d | 642 | 8.9% |
| t | 559 | 7.7% |
| o | 533 | 7.4% |
| a | 533 | 7.4% |
| c | 519 | 7.2% |
| m | 442 | 6.1% |
| w | 442 | 6.1% |
| y | 370 | 5.1% |
| Other values (5) | 1220 | 16.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 442 | 30.5% |
| M | 305 | 21.1% |
| V | 305 | 21.1% |
| U | 214 | 14.8% |
| N | 91 | 6.3% |
| R | 65 | 4.5% |
| A | 26 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 422 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8695 | 95.4% |
| Common | 422 | 4.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1240 | 14.3% |
| h | 747 | 8.6% |
| d | 642 | 7.4% |
| t | 559 | 6.4% |
| o | 533 | 6.1% |
| a | 533 | 6.1% |
| c | 519 | 6.0% |
| S | 442 | 5.1% |
| m | 442 | 5.1% |
| w | 442 | 5.1% |
| Other values (12) | 2596 | 29.9% |
Common
| Value | Count | Frequency (%) |
| 422 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9117 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1240 | 13.6% |
| h | 747 | 8.2% |
| d | 642 | 7.0% |
| t | 559 | 6.1% |
| o | 533 | 5.8% |
| a | 533 | 5.8% |
| c | 519 | 5.7% |
| S | 442 | 4.8% |
| m | 442 | 4.8% |
| w | 442 | 4.8% |
| Other values (13) | 3018 | 33.1% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.903041825 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9366 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Undecided |
|---|
| 4th row | Not Really |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| undecided | 334 | 21.9% |
| somewhat | 325 | 21.3% |
| not | 223 | 14.6% |
| very | 170 | 11.2% |
| much | 170 | 11.2% |
| really | 144 | 9.4% |
| at | 79 | 5.2% |
| all | 79 | 5.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1307 | 14.0% |
| d | 1002 | 10.7% |
| t | 627 | 6.7% |
| a | 548 | 5.9% |
| o | 548 | 5.9% |
| c | 504 | 5.4% |
| h | 495 | 5.3% |
| 472 | 5.0% |
| l | 446 | 4.8% |
| n | 334 | 3.6% |
| Other values (13) | 3083 | 32.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7449 | 79.5% |
| Uppercase Letter | 1445 | 15.4% |
| Space Separator | 472 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1307 | 17.5% |
| d | 1002 | 13.5% |
| t | 627 | 8.4% |
| a | 548 | 7.4% |
| o | 548 | 7.4% |
| c | 504 | 6.8% |
| h | 495 | 6.6% |
| l | 446 | 6.0% |
| n | 334 | 4.5% |
| i | 334 | 4.5% |
| Other values (5) | 1304 | 17.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 334 | 23.1% |
| S | 325 | 22.5% |
| N | 223 | 15.4% |
| V | 170 | 11.8% |
| M | 170 | 11.8% |
| R | 144 | 10.0% |
| A | 79 | 5.5% |
Space Separator
| Value | Count | Frequency (%) |
| 472 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8894 | 95.0% |
| Common | 472 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1307 | 14.7% |
| d | 1002 | 11.3% |
| t | 627 | 7.0% |
| a | 548 | 6.2% |
| o | 548 | 6.2% |
| c | 504 | 5.7% |
| h | 495 | 5.6% |
| l | 446 | 5.0% |
| n | 334 | 3.8% |
| U | 334 | 3.8% |
| Other values (12) | 2749 | 30.9% |
Common
| Value | Count | Frequency (%) |
| 472 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9366 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1307 | 14.0% |
| d | 1002 | 10.7% |
| t | 627 | 6.7% |
| a | 548 | 5.9% |
| o | 548 | 5.9% |
| c | 504 | 5.4% |
| h | 495 | 5.3% |
| 472 | 5.0% |
| l | 446 | 4.8% |
| n | 334 | 3.6% |
| Other values (13) | 3083 | 32.9% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.94581749 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9411 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Undecided |
|---|
| 4th row | Not Really |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| undecided | 355 | 23.4% |
| somewhat | 315 | 20.7% |
| not | 258 | 17.0% |
| really | 172 | 11.3% |
| very | 124 | 8.2% |
| much | 124 | 8.2% |
| at | 86 | 5.7% |
| all | 86 | 5.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1321 | 14.0% |
| d | 1065 | 11.3% |
| t | 659 | 7.0% |
| a | 573 | 6.1% |
| o | 573 | 6.1% |
| l | 516 | 5.5% |
| c | 479 | 5.1% |
| 468 | 5.0% |
| h | 439 | 4.7% |
| n | 355 | 3.8% |
| Other values (13) | 2963 | 31.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7509 | 79.8% |
| Uppercase Letter | 1434 | 15.2% |
| Space Separator | 468 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1321 | 17.6% |
| d | 1065 | 14.2% |
| t | 659 | 8.8% |
| a | 573 | 7.6% |
| o | 573 | 7.6% |
| l | 516 | 6.9% |
| c | 479 | 6.4% |
| h | 439 | 5.8% |
| n | 355 | 4.7% |
| i | 355 | 4.7% |
| Other values (5) | 1174 | 15.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 355 | 24.8% |
| S | 315 | 22.0% |
| N | 258 | 18.0% |
| R | 172 | 12.0% |
| V | 124 | 8.6% |
| M | 124 | 8.6% |
| A | 86 | 6.0% |
Space Separator
| Value | Count | Frequency (%) |
| 468 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8943 | 95.0% |
| Common | 468 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1321 | 14.8% |
| d | 1065 | 11.9% |
| t | 659 | 7.4% |
| a | 573 | 6.4% |
| o | 573 | 6.4% |
| l | 516 | 5.8% |
| c | 479 | 5.4% |
| h | 439 | 4.9% |
| n | 355 | 4.0% |
| U | 355 | 4.0% |
| Other values (12) | 2608 | 29.2% |
Common
| Value | Count | Frequency (%) |
| 468 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9411 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1321 | 14.0% |
| d | 1065 | 11.3% |
| t | 659 | 7.0% |
| a | 573 | 6.1% |
| o | 573 | 6.1% |
| l | 516 | 5.5% |
| c | 479 | 5.1% |
| 468 | 5.0% |
| h | 439 | 4.7% |
| n | 355 | 3.8% |
| Other values (13) | 2963 | 31.5% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9.5 |
|---|
| Mean length | 8.66730038 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9118 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Undecided |
|---|
| 4th row | Not Really |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 438 | 30.4% |
| very | 281 | 19.5% |
| much | 281 | 19.5% |
| undecided | 245 | 17.0% |
| not | 88 | 6.1% |
| really | 66 | 4.6% |
| at | 22 | 1.5% |
| all | 22 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1275 | 14.0% |
| d | 735 | 8.1% |
| h | 719 | 7.9% |
| t | 548 | 6.0% |
| o | 526 | 5.8% |
| a | 526 | 5.8% |
| c | 526 | 5.8% |
| S | 438 | 4.8% |
| m | 438 | 4.8% |
| w | 438 | 4.8% |
| Other values (13) | 2949 | 32.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7306 | 80.1% |
| Uppercase Letter | 1421 | 15.6% |
| Space Separator | 391 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1275 | 17.5% |
| d | 735 | 10.1% |
| h | 719 | 9.8% |
| t | 548 | 7.5% |
| o | 526 | 7.2% |
| a | 526 | 7.2% |
| c | 526 | 7.2% |
| m | 438 | 6.0% |
| w | 438 | 6.0% |
| y | 347 | 4.7% |
| Other values (5) | 1228 | 16.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 438 | 30.8% |
| M | 281 | 19.8% |
| V | 281 | 19.8% |
| U | 245 | 17.2% |
| N | 88 | 6.2% |
| R | 66 | 4.6% |
| A | 22 | 1.5% |
Space Separator
| Value | Count | Frequency (%) |
| 391 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8727 | 95.7% |
| Common | 391 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1275 | 14.6% |
| d | 735 | 8.4% |
| h | 719 | 8.2% |
| t | 548 | 6.3% |
| o | 526 | 6.0% |
| a | 526 | 6.0% |
| c | 526 | 6.0% |
| S | 438 | 5.0% |
| m | 438 | 5.0% |
| w | 438 | 5.0% |
| Other values (12) | 2558 | 29.3% |
Common
| Value | Count | Frequency (%) |
| 391 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9118 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1275 | 14.0% |
| d | 735 | 8.1% |
| h | 719 | 7.9% |
| t | 548 | 6.0% |
| o | 526 | 5.8% |
| a | 526 | 5.8% |
| c | 526 | 5.8% |
| S | 438 | 4.8% |
| m | 438 | 4.8% |
| w | 438 | 4.8% |
| Other values (13) | 2949 | 32.3% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.6378327 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9087 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 465 | 30.8% |
| very | 357 | 23.7% |
| much | 357 | 23.7% |
| undecided | 146 | 9.7% |
| not | 84 | 5.6% |
| really | 68 | 4.5% |
| at | 16 | 1.1% |
| all | 16 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1182 | 13.0% |
| h | 822 | 9.0% |
| t | 565 | 6.2% |
| o | 549 | 6.0% |
| a | 549 | 6.0% |
| c | 503 | 5.5% |
| S | 465 | 5.1% |
| m | 465 | 5.1% |
| w | 465 | 5.1% |
| 457 | 5.0% |
| Other values (13) | 3065 | 33.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7137 | 78.5% |
| Uppercase Letter | 1493 | 16.4% |
| Space Separator | 457 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1182 | 16.6% |
| h | 822 | 11.5% |
| t | 565 | 7.9% |
| o | 549 | 7.7% |
| a | 549 | 7.7% |
| c | 503 | 7.0% |
| m | 465 | 6.5% |
| w | 465 | 6.5% |
| d | 438 | 6.1% |
| y | 425 | 6.0% |
| Other values (5) | 1174 | 16.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 465 | 31.1% |
| M | 357 | 23.9% |
| V | 357 | 23.9% |
| U | 146 | 9.8% |
| N | 84 | 5.6% |
| R | 68 | 4.6% |
| A | 16 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 457 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8630 | 95.0% |
| Common | 457 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1182 | 13.7% |
| h | 822 | 9.5% |
| t | 565 | 6.5% |
| o | 549 | 6.4% |
| a | 549 | 6.4% |
| c | 503 | 5.8% |
| S | 465 | 5.4% |
| m | 465 | 5.4% |
| w | 465 | 5.4% |
| d | 438 | 5.1% |
| Other values (12) | 2627 | 30.4% |
Common
| Value | Count | Frequency (%) |
| 457 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9087 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1182 | 13.0% |
| h | 822 | 9.0% |
| t | 565 | 6.2% |
| o | 549 | 6.0% |
| a | 549 | 6.0% |
| c | 503 | 5.5% |
| S | 465 | 5.1% |
| m | 465 | 5.1% |
| w | 465 | 5.1% |
| 457 | 5.0% |
| Other values (13) | 3065 | 33.7% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.72243346 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9176 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Undecided |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 423 | 29.5% |
| undecided | 286 | 20.0% |
| very | 212 | 14.8% |
| much | 212 | 14.8% |
| not | 131 | 9.1% |
| really | 94 | 6.6% |
| at | 37 | 2.6% |
| all | 37 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1301 | 14.2% |
| d | 858 | 9.4% |
| h | 635 | 6.9% |
| t | 591 | 6.4% |
| o | 554 | 6.0% |
| a | 554 | 6.0% |
| c | 498 | 5.4% |
| S | 423 | 4.6% |
| w | 423 | 4.6% |
| m | 423 | 4.6% |
| Other values (13) | 2916 | 31.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7401 | 80.7% |
| Uppercase Letter | 1395 | 15.2% |
| Space Separator | 380 | 4.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1301 | 17.6% |
| d | 858 | 11.6% |
| h | 635 | 8.6% |
| t | 591 | 8.0% |
| o | 554 | 7.5% |
| a | 554 | 7.5% |
| c | 498 | 6.7% |
| w | 423 | 5.7% |
| m | 423 | 5.7% |
| y | 306 | 4.1% |
| Other values (5) | 1258 | 17.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 423 | 30.3% |
| U | 286 | 20.5% |
| V | 212 | 15.2% |
| M | 212 | 15.2% |
| N | 131 | 9.4% |
| R | 94 | 6.7% |
| A | 37 | 2.7% |
Space Separator
| Value | Count | Frequency (%) |
| 380 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8796 | 95.9% |
| Common | 380 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1301 | 14.8% |
| d | 858 | 9.8% |
| h | 635 | 7.2% |
| t | 591 | 6.7% |
| o | 554 | 6.3% |
| a | 554 | 6.3% |
| c | 498 | 5.7% |
| S | 423 | 4.8% |
| w | 423 | 4.8% |
| m | 423 | 4.8% |
| Other values (12) | 2536 | 28.8% |
Common
| Value | Count | Frequency (%) |
| 380 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9176 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1301 | 14.2% |
| d | 858 | 9.4% |
| h | 635 | 6.9% |
| t | 591 | 6.4% |
| o | 554 | 6.0% |
| a | 554 | 6.0% |
| c | 498 | 5.4% |
| S | 423 | 4.6% |
| w | 423 | 4.6% |
| m | 423 | 4.6% |
| Other values (13) | 2916 | 31.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.760456274 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9216 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Undecided |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 383 | 24.8% |
| very | 318 | 20.6% |
| much | 318 | 20.6% |
| undecided | 220 | 14.3% |
| not | 131 | 8.5% |
| really | 90 | 5.8% |
| at | 41 | 2.7% |
| all | 41 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1231 | 13.4% |
| h | 701 | 7.6% |
| d | 660 | 7.2% |
| t | 555 | 6.0% |
| c | 538 | 5.8% |
| a | 514 | 5.6% |
| o | 514 | 5.6% |
| 490 | 5.3% |
| y | 408 | 4.4% |
| S | 383 | 4.2% |
| Other values (13) | 3222 | 35.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7225 | 78.4% |
| Uppercase Letter | 1501 | 16.3% |
| Space Separator | 490 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1231 | 17.0% |
| h | 701 | 9.7% |
| d | 660 | 9.1% |
| t | 555 | 7.7% |
| c | 538 | 7.4% |
| a | 514 | 7.1% |
| o | 514 | 7.1% |
| y | 408 | 5.6% |
| w | 383 | 5.3% |
| m | 383 | 5.3% |
| Other values (5) | 1338 | 18.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 383 | 25.5% |
| V | 318 | 21.2% |
| M | 318 | 21.2% |
| U | 220 | 14.7% |
| N | 131 | 8.7% |
| R | 90 | 6.0% |
| A | 41 | 2.7% |
Space Separator
| Value | Count | Frequency (%) |
| 490 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8726 | 94.7% |
| Common | 490 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1231 | 14.1% |
| h | 701 | 8.0% |
| d | 660 | 7.6% |
| t | 555 | 6.4% |
| c | 538 | 6.2% |
| a | 514 | 5.9% |
| o | 514 | 5.9% |
| y | 408 | 4.7% |
| S | 383 | 4.4% |
| w | 383 | 4.4% |
| Other values (12) | 2839 | 32.5% |
Common
| Value | Count | Frequency (%) |
| 490 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9216 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1231 | 13.4% |
| h | 701 | 7.6% |
| d | 660 | 7.2% |
| t | 555 | 6.0% |
| c | 538 | 5.8% |
| a | 514 | 5.6% |
| o | 514 | 5.6% |
| 490 | 5.3% |
| y | 408 | 4.4% |
| S | 383 | 4.2% |
| Other values (13) | 3222 | 35.0% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.72338403 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9177 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Not Really |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 423 | 29.4% |
| undecided | 284 | 19.8% |
| very | 213 | 14.8% |
| much | 213 | 14.8% |
| not | 132 | 9.2% |
| really | 92 | 6.4% |
| at | 40 | 2.8% |
| all | 40 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1296 | 14.1% |
| d | 852 | 9.3% |
| h | 636 | 6.9% |
| t | 595 | 6.5% |
| o | 555 | 6.0% |
| a | 555 | 6.0% |
| c | 497 | 5.4% |
| S | 423 | 4.6% |
| w | 423 | 4.6% |
| m | 423 | 4.6% |
| Other values (13) | 2922 | 31.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7395 | 80.6% |
| Uppercase Letter | 1397 | 15.2% |
| Space Separator | 385 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1296 | 17.5% |
| d | 852 | 11.5% |
| h | 636 | 8.6% |
| t | 595 | 8.0% |
| o | 555 | 7.5% |
| a | 555 | 7.5% |
| c | 497 | 6.7% |
| w | 423 | 5.7% |
| m | 423 | 5.7% |
| y | 305 | 4.1% |
| Other values (5) | 1258 | 17.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 423 | 30.3% |
| U | 284 | 20.3% |
| V | 213 | 15.2% |
| M | 213 | 15.2% |
| N | 132 | 9.4% |
| R | 92 | 6.6% |
| A | 40 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 385 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8792 | 95.8% |
| Common | 385 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1296 | 14.7% |
| d | 852 | 9.7% |
| h | 636 | 7.2% |
| t | 595 | 6.8% |
| o | 555 | 6.3% |
| a | 555 | 6.3% |
| c | 497 | 5.7% |
| S | 423 | 4.8% |
| w | 423 | 4.8% |
| m | 423 | 4.8% |
| Other values (12) | 2537 | 28.9% |
Common
| Value | Count | Frequency (%) |
| 385 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9177 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1296 | 14.1% |
| d | 852 | 9.3% |
| h | 636 | 6.9% |
| t | 595 | 6.5% |
| o | 555 | 6.0% |
| a | 555 | 6.0% |
| c | 497 | 5.4% |
| S | 423 | 4.6% |
| w | 423 | 4.6% |
| m | 423 | 4.6% |
| Other values (13) | 2922 | 31.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.66634981 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9117 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Undecided |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 454 | 31.6% |
| very | 254 | 17.7% |
| much | 254 | 17.7% |
| undecided | 241 | 16.8% |
| not | 103 | 7.2% |
| really | 77 | 5.4% |
| at | 26 | 1.8% |
| all | 26 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1267 | 13.9% |
| d | 723 | 7.9% |
| h | 708 | 7.8% |
| t | 583 | 6.4% |
| o | 557 | 6.1% |
| a | 557 | 6.1% |
| c | 495 | 5.4% |
| S | 454 | 5.0% |
| m | 454 | 5.0% |
| w | 454 | 5.0% |
| Other values (13) | 2865 | 31.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7325 | 80.3% |
| Uppercase Letter | 1409 | 15.5% |
| Space Separator | 383 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1267 | 17.3% |
| d | 723 | 9.9% |
| h | 708 | 9.7% |
| t | 583 | 8.0% |
| o | 557 | 7.6% |
| a | 557 | 7.6% |
| c | 495 | 6.8% |
| m | 454 | 6.2% |
| w | 454 | 6.2% |
| y | 331 | 4.5% |
| Other values (5) | 1196 | 16.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 454 | 32.2% |
| M | 254 | 18.0% |
| V | 254 | 18.0% |
| U | 241 | 17.1% |
| N | 103 | 7.3% |
| R | 77 | 5.5% |
| A | 26 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 383 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8734 | 95.8% |
| Common | 383 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1267 | 14.5% |
| d | 723 | 8.3% |
| h | 708 | 8.1% |
| t | 583 | 6.7% |
| o | 557 | 6.4% |
| a | 557 | 6.4% |
| c | 495 | 5.7% |
| S | 454 | 5.2% |
| m | 454 | 5.2% |
| w | 454 | 5.2% |
| Other values (12) | 2482 | 28.4% |
Common
| Value | Count | Frequency (%) |
| 383 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9117 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1267 | 13.9% |
| d | 723 | 7.9% |
| h | 708 | 7.8% |
| t | 583 | 6.4% |
| o | 557 | 6.1% |
| a | 557 | 6.1% |
| c | 495 | 5.4% |
| S | 454 | 5.0% |
| m | 454 | 5.0% |
| w | 454 | 5.0% |
| Other values (13) | 2865 | 31.4% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.615019011 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9063 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Undecided |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 467 | 30.9% |
| very | 382 | 25.3% |
| much | 382 | 25.3% |
| undecided | 141 | 9.3% |
| not | 62 | 4.1% |
| really | 46 | 3.0% |
| at | 16 | 1.1% |
| all | 16 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1177 | 13.0% |
| h | 849 | 9.4% |
| t | 545 | 6.0% |
| o | 529 | 5.8% |
| a | 529 | 5.8% |
| c | 523 | 5.8% |
| S | 467 | 5.2% |
| m | 467 | 5.2% |
| w | 467 | 5.2% |
| 460 | 5.1% |
| Other values (13) | 3050 | 33.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7107 | 78.4% |
| Uppercase Letter | 1496 | 16.5% |
| Space Separator | 460 | 5.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1177 | 16.6% |
| h | 849 | 11.9% |
| t | 545 | 7.7% |
| o | 529 | 7.4% |
| a | 529 | 7.4% |
| c | 523 | 7.4% |
| m | 467 | 6.6% |
| w | 467 | 6.6% |
| y | 428 | 6.0% |
| d | 423 | 6.0% |
| Other values (5) | 1170 | 16.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 467 | 31.2% |
| M | 382 | 25.5% |
| V | 382 | 25.5% |
| U | 141 | 9.4% |
| N | 62 | 4.1% |
| R | 46 | 3.1% |
| A | 16 | 1.1% |
Space Separator
| Value | Count | Frequency (%) |
| 460 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8603 | 94.9% |
| Common | 460 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1177 | 13.7% |
| h | 849 | 9.9% |
| t | 545 | 6.3% |
| o | 529 | 6.1% |
| a | 529 | 6.1% |
| c | 523 | 6.1% |
| S | 467 | 5.4% |
| m | 467 | 5.4% |
| w | 467 | 5.4% |
| y | 428 | 5.0% |
| Other values (12) | 2622 | 30.5% |
Common
| Value | Count | Frequency (%) |
| 460 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9063 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1177 | 13.0% |
| h | 849 | 9.4% |
| t | 545 | 6.0% |
| o | 529 | 5.8% |
| a | 529 | 5.8% |
| c | 523 | 5.8% |
| S | 467 | 5.2% |
| m | 467 | 5.2% |
| w | 467 | 5.2% |
| 460 | 5.1% |
| Other values (13) | 3050 | 33.7% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.659695817 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9110 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Undecided |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 445 | 30.9% |
| very | 278 | 19.3% |
| much | 278 | 19.3% |
| undecided | 242 | 16.8% |
| not | 87 | 6.0% |
| really | 62 | 4.3% |
| at | 25 | 1.7% |
| all | 25 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1269 | 13.9% |
| d | 726 | 8.0% |
| h | 723 | 7.9% |
| t | 557 | 6.1% |
| o | 532 | 5.8% |
| a | 532 | 5.8% |
| c | 520 | 5.7% |
| S | 445 | 4.9% |
| m | 445 | 4.9% |
| w | 445 | 4.9% |
| Other values (13) | 2916 | 32.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7303 | 80.2% |
| Uppercase Letter | 1417 | 15.6% |
| Space Separator | 390 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1269 | 17.4% |
| d | 726 | 9.9% |
| h | 723 | 9.9% |
| t | 557 | 7.6% |
| o | 532 | 7.3% |
| a | 532 | 7.3% |
| c | 520 | 7.1% |
| m | 445 | 6.1% |
| w | 445 | 6.1% |
| y | 340 | 4.7% |
| Other values (5) | 1214 | 16.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 445 | 31.4% |
| M | 278 | 19.6% |
| V | 278 | 19.6% |
| U | 242 | 17.1% |
| N | 87 | 6.1% |
| R | 62 | 4.4% |
| A | 25 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 390 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8720 | 95.7% |
| Common | 390 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1269 | 14.6% |
| d | 726 | 8.3% |
| h | 723 | 8.3% |
| t | 557 | 6.4% |
| o | 532 | 6.1% |
| a | 532 | 6.1% |
| c | 520 | 6.0% |
| S | 445 | 5.1% |
| m | 445 | 5.1% |
| w | 445 | 5.1% |
| Other values (12) | 2526 | 29.0% |
Common
| Value | Count | Frequency (%) |
| 390 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9110 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1269 | 13.9% |
| d | 726 | 8.0% |
| h | 723 | 7.9% |
| t | 557 | 6.1% |
| o | 532 | 5.8% |
| a | 532 | 5.8% |
| c | 520 | 5.7% |
| S | 445 | 4.9% |
| m | 445 | 4.9% |
| w | 445 | 4.9% |
| Other values (13) | 2916 | 32.0% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.655893536 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9106 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Undecided |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 460 | 31.9% |
| very | 266 | 18.4% |
| much | 266 | 18.4% |
| undecided | 228 | 15.8% |
| not | 98 | 6.8% |
| really | 70 | 4.8% |
| at | 28 | 1.9% |
| all | 28 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1252 | 13.7% |
| h | 726 | 8.0% |
| d | 684 | 7.5% |
| t | 586 | 6.4% |
| o | 558 | 6.1% |
| a | 558 | 6.1% |
| c | 494 | 5.4% |
| S | 460 | 5.1% |
| m | 460 | 5.1% |
| w | 460 | 5.1% |
| Other values (13) | 2868 | 31.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7298 | 80.1% |
| Uppercase Letter | 1416 | 15.6% |
| Space Separator | 392 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1252 | 17.2% |
| h | 726 | 9.9% |
| d | 684 | 9.4% |
| t | 586 | 8.0% |
| o | 558 | 7.6% |
| a | 558 | 7.6% |
| c | 494 | 6.8% |
| m | 460 | 6.3% |
| w | 460 | 6.3% |
| y | 336 | 4.6% |
| Other values (5) | 1184 | 16.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 460 | 32.5% |
| M | 266 | 18.8% |
| V | 266 | 18.8% |
| U | 228 | 16.1% |
| N | 98 | 6.9% |
| R | 70 | 4.9% |
| A | 28 | 2.0% |
Space Separator
| Value | Count | Frequency (%) |
| 392 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8714 | 95.7% |
| Common | 392 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1252 | 14.4% |
| h | 726 | 8.3% |
| d | 684 | 7.8% |
| t | 586 | 6.7% |
| o | 558 | 6.4% |
| a | 558 | 6.4% |
| c | 494 | 5.7% |
| S | 460 | 5.3% |
| m | 460 | 5.3% |
| w | 460 | 5.3% |
| Other values (12) | 2476 | 28.4% |
Common
| Value | Count | Frequency (%) |
| 392 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9106 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1252 | 13.7% |
| h | 726 | 8.0% |
| d | 684 | 7.5% |
| t | 586 | 6.4% |
| o | 558 | 6.1% |
| a | 558 | 6.1% |
| c | 494 | 5.4% |
| S | 460 | 5.1% |
| m | 460 | 5.1% |
| w | 460 | 5.1% |
| Other values (13) | 2868 | 31.5% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.727186312 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9181 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Undecided |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 410 | 28.4% |
| undecided | 285 | 19.8% |
| very | 234 | 16.2% |
| much | 234 | 16.2% |
| not | 123 | 8.5% |
| really | 89 | 6.2% |
| at | 34 | 2.4% |
| all | 34 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1303 | 14.2% |
| d | 855 | 9.3% |
| h | 644 | 7.0% |
| t | 567 | 6.2% |
| o | 533 | 5.8% |
| a | 533 | 5.8% |
| c | 519 | 5.7% |
| S | 410 | 4.5% |
| w | 410 | 4.5% |
| m | 410 | 4.5% |
| Other values (13) | 2997 | 32.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7381 | 80.4% |
| Uppercase Letter | 1409 | 15.3% |
| Space Separator | 391 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1303 | 17.7% |
| d | 855 | 11.6% |
| h | 644 | 8.7% |
| t | 567 | 7.7% |
| o | 533 | 7.2% |
| a | 533 | 7.2% |
| c | 519 | 7.0% |
| w | 410 | 5.6% |
| m | 410 | 5.6% |
| y | 323 | 4.4% |
| Other values (5) | 1284 | 17.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 410 | 29.1% |
| U | 285 | 20.2% |
| V | 234 | 16.6% |
| M | 234 | 16.6% |
| N | 123 | 8.7% |
| R | 89 | 6.3% |
| A | 34 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 391 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8790 | 95.7% |
| Common | 391 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1303 | 14.8% |
| d | 855 | 9.7% |
| h | 644 | 7.3% |
| t | 567 | 6.5% |
| o | 533 | 6.1% |
| a | 533 | 6.1% |
| c | 519 | 5.9% |
| S | 410 | 4.7% |
| w | 410 | 4.7% |
| m | 410 | 4.7% |
| Other values (12) | 2606 | 29.6% |
Common
| Value | Count | Frequency (%) |
| 391 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9181 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1303 | 14.2% |
| d | 855 | 9.3% |
| h | 644 | 7.0% |
| t | 567 | 6.2% |
| o | 533 | 5.8% |
| a | 533 | 5.8% |
| c | 519 | 5.7% |
| S | 410 | 4.5% |
| w | 410 | 4.5% |
| m | 410 | 4.5% |
| Other values (13) | 2997 | 32.6% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.741444867 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9196 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 395 | 28.7% |
| undecided | 370 | 26.9% |
| very | 164 | 11.9% |
| much | 164 | 11.9% |
| not | 123 | 9.0% |
| really | 88 | 6.4% |
| at | 35 | 2.5% |
| all | 35 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1387 | 15.1% |
| d | 1110 | 12.1% |
| h | 559 | 6.1% |
| t | 553 | 6.0% |
| c | 534 | 5.8% |
| o | 518 | 5.6% |
| a | 518 | 5.6% |
| S | 395 | 4.3% |
| w | 395 | 4.3% |
| m | 395 | 4.3% |
| Other values (13) | 2832 | 30.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7535 | 81.9% |
| Uppercase Letter | 1339 | 14.6% |
| Space Separator | 322 | 3.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1387 | 18.4% |
| d | 1110 | 14.7% |
| h | 559 | 7.4% |
| t | 553 | 7.3% |
| c | 534 | 7.1% |
| o | 518 | 6.9% |
| a | 518 | 6.9% |
| w | 395 | 5.2% |
| m | 395 | 5.2% |
| n | 370 | 4.9% |
| Other values (5) | 1196 | 15.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 395 | 29.5% |
| U | 370 | 27.6% |
| V | 164 | 12.2% |
| M | 164 | 12.2% |
| N | 123 | 9.2% |
| R | 88 | 6.6% |
| A | 35 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 322 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8874 | 96.5% |
| Common | 322 | 3.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1387 | 15.6% |
| d | 1110 | 12.5% |
| h | 559 | 6.3% |
| t | 553 | 6.2% |
| c | 534 | 6.0% |
| o | 518 | 5.8% |
| a | 518 | 5.8% |
| S | 395 | 4.5% |
| w | 395 | 4.5% |
| m | 395 | 4.5% |
| Other values (12) | 2510 | 28.3% |
Common
| Value | Count | Frequency (%) |
| 322 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9196 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1387 | 15.1% |
| d | 1110 | 12.1% |
| h | 559 | 6.1% |
| t | 553 | 6.0% |
| c | 534 | 5.8% |
| o | 518 | 5.6% |
| a | 518 | 5.6% |
| S | 395 | 4.3% |
| w | 395 | 4.3% |
| m | 395 | 4.3% |
| Other values (13) | 2832 | 30.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.692965779 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9145 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 432 | 28.9% |
| very | 304 | 20.4% |
| much | 304 | 20.4% |
| undecided | 207 | 13.9% |
| not | 109 | 7.3% |
| really | 81 | 5.4% |
| at | 28 | 1.9% |
| all | 28 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1231 | 13.5% |
| h | 736 | 8.0% |
| d | 621 | 6.8% |
| t | 569 | 6.2% |
| o | 541 | 5.9% |
| a | 541 | 5.9% |
| c | 511 | 5.6% |
| 441 | 4.8% |
| S | 432 | 4.7% |
| w | 432 | 4.7% |
| Other values (13) | 3090 | 33.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7239 | 79.2% |
| Uppercase Letter | 1465 | 16.0% |
| Space Separator | 441 | 4.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1231 | 17.0% |
| h | 736 | 10.2% |
| d | 621 | 8.6% |
| t | 569 | 7.9% |
| o | 541 | 7.5% |
| a | 541 | 7.5% |
| c | 511 | 7.1% |
| w | 432 | 6.0% |
| m | 432 | 6.0% |
| y | 385 | 5.3% |
| Other values (5) | 1240 | 17.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 432 | 29.5% |
| V | 304 | 20.8% |
| M | 304 | 20.8% |
| U | 207 | 14.1% |
| N | 109 | 7.4% |
| R | 81 | 5.5% |
| A | 28 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 441 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8704 | 95.2% |
| Common | 441 | 4.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1231 | 14.1% |
| h | 736 | 8.5% |
| d | 621 | 7.1% |
| t | 569 | 6.5% |
| o | 541 | 6.2% |
| a | 541 | 6.2% |
| c | 511 | 5.9% |
| S | 432 | 5.0% |
| w | 432 | 5.0% |
| m | 432 | 5.0% |
| Other values (12) | 2658 | 30.5% |
Common
| Value | Count | Frequency (%) |
| 441 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9145 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1231 | 13.5% |
| h | 736 | 8.0% |
| d | 621 | 6.8% |
| t | 569 | 6.2% |
| o | 541 | 5.9% |
| a | 541 | 5.9% |
| c | 511 | 5.6% |
| 441 | 4.8% |
| S | 432 | 4.7% |
| w | 432 | 4.7% |
| Other values (13) | 3090 | 33.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.779467681 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9236 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 417 | 27.1% |
| very | 235 | 15.3% |
| much | 235 | 15.3% |
| undecided | 215 | 14.0% |
| not | 185 | 12.0% |
| really | 117 | 7.6% |
| at | 68 | 4.4% |
| all | 68 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1199 | 13.0% |
| t | 670 | 7.3% |
| h | 652 | 7.1% |
| d | 645 | 7.0% |
| o | 602 | 6.5% |
| a | 602 | 6.5% |
| 488 | 5.3% |
| c | 450 | 4.9% |
| S | 417 | 4.5% |
| w | 417 | 4.5% |
| Other values (13) | 3094 | 33.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7276 | 78.8% |
| Uppercase Letter | 1472 | 15.9% |
| Space Separator | 488 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1199 | 16.5% |
| t | 670 | 9.2% |
| h | 652 | 9.0% |
| d | 645 | 8.9% |
| o | 602 | 8.3% |
| a | 602 | 8.3% |
| c | 450 | 6.2% |
| w | 417 | 5.7% |
| m | 417 | 5.7% |
| l | 370 | 5.1% |
| Other values (5) | 1252 | 17.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 417 | 28.3% |
| V | 235 | 16.0% |
| M | 235 | 16.0% |
| U | 215 | 14.6% |
| N | 185 | 12.6% |
| R | 117 | 7.9% |
| A | 68 | 4.6% |
Space Separator
| Value | Count | Frequency (%) |
| 488 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8748 | 94.7% |
| Common | 488 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1199 | 13.7% |
| t | 670 | 7.7% |
| h | 652 | 7.5% |
| d | 645 | 7.4% |
| o | 602 | 6.9% |
| a | 602 | 6.9% |
| c | 450 | 5.1% |
| S | 417 | 4.8% |
| w | 417 | 4.8% |
| m | 417 | 4.8% |
| Other values (12) | 2677 | 30.6% |
Common
| Value | Count | Frequency (%) |
| 488 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9236 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1199 | 13.0% |
| t | 670 | 7.3% |
| h | 652 | 7.1% |
| d | 645 | 7.0% |
| o | 602 | 6.5% |
| a | 602 | 6.5% |
| 488 | 5.3% |
| c | 450 | 4.9% |
| S | 417 | 4.5% |
| w | 417 | 4.5% |
| Other values (13) | 3094 | 33.5% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.689163498 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9141 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 408 | 26.3% |
| very | 392 | 25.3% |
| much | 392 | 25.3% |
| undecided | 171 | 11.0% |
| not | 81 | 5.2% |
| really | 57 | 3.7% |
| at | 24 | 1.5% |
| all | 24 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1199 | 13.1% |
| h | 800 | 8.8% |
| c | 563 | 6.2% |
| d | 513 | 5.6% |
| t | 513 | 5.6% |
| 497 | 5.4% |
| a | 489 | 5.3% |
| o | 489 | 5.3% |
| y | 449 | 4.9% |
| S | 408 | 4.5% |
| Other values (13) | 3221 | 35.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7119 | 77.9% |
| Uppercase Letter | 1525 | 16.7% |
| Space Separator | 497 | 5.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1199 | 16.8% |
| h | 800 | 11.2% |
| c | 563 | 7.9% |
| d | 513 | 7.2% |
| t | 513 | 7.2% |
| a | 489 | 6.9% |
| o | 489 | 6.9% |
| y | 449 | 6.3% |
| w | 408 | 5.7% |
| m | 408 | 5.7% |
| Other values (5) | 1288 | 18.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 408 | 26.8% |
| V | 392 | 25.7% |
| M | 392 | 25.7% |
| U | 171 | 11.2% |
| N | 81 | 5.3% |
| R | 57 | 3.7% |
| A | 24 | 1.6% |
Space Separator
| Value | Count | Frequency (%) |
| 497 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8644 | 94.6% |
| Common | 497 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1199 | 13.9% |
| h | 800 | 9.3% |
| c | 563 | 6.5% |
| d | 513 | 5.9% |
| t | 513 | 5.9% |
| a | 489 | 5.7% |
| o | 489 | 5.7% |
| y | 449 | 5.2% |
| S | 408 | 4.7% |
| w | 408 | 4.7% |
| Other values (12) | 2813 | 32.5% |
Common
| Value | Count | Frequency (%) |
| 497 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9141 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1199 | 13.1% |
| h | 800 | 8.8% |
| c | 563 | 6.2% |
| d | 513 | 5.6% |
| t | 513 | 5.6% |
| 497 | 5.4% |
| a | 489 | 5.3% |
| o | 489 | 5.3% |
| y | 449 | 4.9% |
| S | 408 | 4.5% |
| Other values (13) | 3221 | 35.2% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.715779468 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9169 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Very Much |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 419 | 26.5% |
| very | 369 | 23.3% |
| much | 369 | 23.3% |
| undecided | 144 | 9.1% |
| not | 120 | 7.6% |
| really | 80 | 5.1% |
| at | 40 | 2.5% |
| all | 40 | 2.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1156 | 12.6% |
| h | 788 | 8.6% |
| t | 579 | 6.3% |
| a | 539 | 5.9% |
| o | 539 | 5.9% |
| 529 | 5.8% |
| c | 513 | 5.6% |
| y | 449 | 4.9% |
| d | 432 | 4.7% |
| S | 419 | 4.6% |
| Other values (13) | 3226 | 35.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7099 | 77.4% |
| Uppercase Letter | 1541 | 16.8% |
| Space Separator | 529 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1156 | 16.3% |
| h | 788 | 11.1% |
| t | 579 | 8.2% |
| a | 539 | 7.6% |
| o | 539 | 7.6% |
| c | 513 | 7.2% |
| y | 449 | 6.3% |
| d | 432 | 6.1% |
| w | 419 | 5.9% |
| m | 419 | 5.9% |
| Other values (5) | 1266 | 17.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 419 | 27.2% |
| V | 369 | 23.9% |
| M | 369 | 23.9% |
| U | 144 | 9.3% |
| N | 120 | 7.8% |
| R | 80 | 5.2% |
| A | 40 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 529 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8640 | 94.2% |
| Common | 529 | 5.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1156 | 13.4% |
| h | 788 | 9.1% |
| t | 579 | 6.7% |
| a | 539 | 6.2% |
| o | 539 | 6.2% |
| c | 513 | 5.9% |
| y | 449 | 5.2% |
| d | 432 | 5.0% |
| S | 419 | 4.8% |
| w | 419 | 4.8% |
| Other values (12) | 2807 | 32.5% |
Common
| Value | Count | Frequency (%) |
| 529 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9169 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1156 | 12.6% |
| h | 788 | 8.6% |
| t | 579 | 6.3% |
| a | 539 | 5.9% |
| o | 539 | 5.9% |
| 529 | 5.8% |
| c | 513 | 5.6% |
| y | 449 | 4.9% |
| d | 432 | 4.7% |
| S | 419 | 4.6% |
| Other values (13) | 3226 | 35.2% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.646387833 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9096 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| very | 488 | 30.6% |
| much | 488 | 30.6% |
| somewhat | 416 | 26.0% |
| undecided | 104 | 6.5% |
| not | 44 | 2.8% |
| really | 31 | 1.9% |
| at | 13 | 0.8% |
| all | 13 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1143 | 12.6% |
| h | 904 | 9.9% |
| c | 592 | 6.5% |
| 545 | 6.0% |
| y | 519 | 5.7% |
| V | 488 | 5.4% |
| r | 488 | 5.4% |
| M | 488 | 5.4% |
| u | 488 | 5.4% |
| t | 473 | 5.2% |
| Other values (13) | 2968 | 32.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6967 | 76.6% |
| Uppercase Letter | 1584 | 17.4% |
| Space Separator | 545 | 6.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1143 | 16.4% |
| h | 904 | 13.0% |
| c | 592 | 8.5% |
| y | 519 | 7.4% |
| r | 488 | 7.0% |
| u | 488 | 7.0% |
| t | 473 | 6.8% |
| a | 460 | 6.6% |
| o | 460 | 6.6% |
| m | 416 | 6.0% |
| Other values (5) | 1024 | 14.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 488 | 30.8% |
| M | 488 | 30.8% |
| S | 416 | 26.3% |
| U | 104 | 6.6% |
| N | 44 | 2.8% |
| R | 31 | 2.0% |
| A | 13 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 545 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8551 | 94.0% |
| Common | 545 | 6.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1143 | 13.4% |
| h | 904 | 10.6% |
| c | 592 | 6.9% |
| y | 519 | 6.1% |
| V | 488 | 5.7% |
| r | 488 | 5.7% |
| M | 488 | 5.7% |
| u | 488 | 5.7% |
| t | 473 | 5.5% |
| a | 460 | 5.4% |
| Other values (12) | 2508 | 29.3% |
Common
| Value | Count | Frequency (%) |
| 545 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9096 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1143 | 12.6% |
| h | 904 | 9.9% |
| c | 592 | 6.5% |
| 545 | 6.0% |
| y | 519 | 5.7% |
| V | 488 | 5.4% |
| r | 488 | 5.4% |
| M | 488 | 5.4% |
| u | 488 | 5.4% |
| t | 473 | 5.2% |
| Other values (13) | 2968 | 32.6% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.77851711 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9235 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| somewhat | 420 | 27.2% |
| very | 238 | 15.4% |
| much | 238 | 15.4% |
| undecided | 207 | 13.4% |
| not | 187 | 12.1% |
| really | 121 | 7.8% |
| at | 66 | 4.3% |
| all | 66 | 4.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1193 | 12.9% |
| t | 673 | 7.3% |
| h | 658 | 7.1% |
| d | 621 | 6.7% |
| o | 607 | 6.6% |
| a | 607 | 6.6% |
| 491 | 5.3% |
| c | 445 | 4.8% |
| S | 420 | 4.5% |
| w | 420 | 4.5% |
| Other values (13) | 3100 | 33.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7267 | 78.7% |
| Uppercase Letter | 1477 | 16.0% |
| Space Separator | 491 | 5.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1193 | 16.4% |
| t | 673 | 9.3% |
| h | 658 | 9.1% |
| d | 621 | 8.5% |
| o | 607 | 8.4% |
| a | 607 | 8.4% |
| c | 445 | 6.1% |
| w | 420 | 5.8% |
| m | 420 | 5.8% |
| l | 374 | 5.1% |
| Other values (5) | 1249 | 17.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 420 | 28.4% |
| V | 238 | 16.1% |
| M | 238 | 16.1% |
| U | 207 | 14.0% |
| N | 187 | 12.7% |
| R | 121 | 8.2% |
| A | 66 | 4.5% |
Space Separator
| Value | Count | Frequency (%) |
| 491 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8744 | 94.7% |
| Common | 491 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1193 | 13.6% |
| t | 673 | 7.7% |
| h | 658 | 7.5% |
| d | 621 | 7.1% |
| o | 607 | 6.9% |
| a | 607 | 6.9% |
| c | 445 | 5.1% |
| S | 420 | 4.8% |
| w | 420 | 4.8% |
| m | 420 | 4.8% |
| Other values (12) | 2680 | 30.6% |
Common
| Value | Count | Frequency (%) |
| 491 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9235 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1193 | 12.9% |
| t | 673 | 7.3% |
| h | 658 | 7.1% |
| d | 621 | 6.7% |
| o | 607 | 6.6% |
| a | 607 | 6.6% |
| 491 | 5.3% |
| c | 445 | 4.8% |
| S | 420 | 4.5% |
| w | 420 | 4.5% |
| Other values (13) | 3100 | 33.6% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.682509506 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9134 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Very Much |
|---|
| 4th row | Somewhat |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| very | 518 | 31.7% |
| much | 518 | 31.7% |
| somewhat | 381 | 23.3% |
| undecided | 106 | 6.5% |
| not | 47 | 2.9% |
| really | 32 | 2.0% |
| at | 15 | 0.9% |
| all | 15 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1143 | 12.5% |
| h | 899 | 9.8% |
| c | 624 | 6.8% |
| 580 | 6.3% |
| y | 550 | 6.0% |
| V | 518 | 5.7% |
| r | 518 | 5.7% |
| M | 518 | 5.7% |
| u | 518 | 5.7% |
| t | 443 | 4.9% |
| Other values (13) | 2823 | 30.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6937 | 75.9% |
| Uppercase Letter | 1617 | 17.7% |
| Space Separator | 580 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1143 | 16.5% |
| h | 899 | 13.0% |
| c | 624 | 9.0% |
| y | 550 | 7.9% |
| r | 518 | 7.5% |
| u | 518 | 7.5% |
| t | 443 | 6.4% |
| a | 428 | 6.2% |
| o | 428 | 6.2% |
| m | 381 | 5.5% |
| Other values (5) | 1005 | 14.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 518 | 32.0% |
| M | 518 | 32.0% |
| S | 381 | 23.6% |
| U | 106 | 6.6% |
| N | 47 | 2.9% |
| R | 32 | 2.0% |
| A | 15 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 580 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8554 | 93.7% |
| Common | 580 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1143 | 13.4% |
| h | 899 | 10.5% |
| c | 624 | 7.3% |
| y | 550 | 6.4% |
| V | 518 | 6.1% |
| r | 518 | 6.1% |
| M | 518 | 6.1% |
| u | 518 | 6.1% |
| t | 443 | 5.2% |
| a | 428 | 5.0% |
| Other values (12) | 2395 | 28.0% |
Common
| Value | Count | Frequency (%) |
| 580 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9134 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1143 | 12.5% |
| h | 899 | 9.8% |
| c | 624 | 6.8% |
| 580 | 6.3% |
| y | 550 | 6.0% |
| V | 518 | 5.7% |
| r | 518 | 5.7% |
| M | 518 | 5.7% |
| u | 518 | 5.7% |
| t | 443 | 4.9% |
| Other values (13) | 2823 | 30.9% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.69391635 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9146 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Very Much |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 443 | 30.1% |
| very | 269 | 18.2% |
| much | 269 | 18.2% |
| undecided | 219 | 14.9% |
| not | 121 | 8.2% |
| really | 89 | 6.0% |
| at | 32 | 2.2% |
| all | 32 | 2.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1239 | 13.5% |
| h | 712 | 7.8% |
| d | 657 | 7.2% |
| t | 596 | 6.5% |
| o | 564 | 6.2% |
| a | 564 | 6.2% |
| c | 488 | 5.3% |
| S | 443 | 4.8% |
| m | 443 | 4.8% |
| w | 443 | 4.8% |
| Other values (13) | 2997 | 32.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7282 | 79.6% |
| Uppercase Letter | 1442 | 15.8% |
| Space Separator | 422 | 4.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1239 | 17.0% |
| h | 712 | 9.8% |
| d | 657 | 9.0% |
| t | 596 | 8.2% |
| o | 564 | 7.7% |
| a | 564 | 7.7% |
| c | 488 | 6.7% |
| m | 443 | 6.1% |
| w | 443 | 6.1% |
| y | 358 | 4.9% |
| Other values (5) | 1218 | 16.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 443 | 30.7% |
| M | 269 | 18.7% |
| V | 269 | 18.7% |
| U | 219 | 15.2% |
| N | 121 | 8.4% |
| R | 89 | 6.2% |
| A | 32 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 422 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8724 | 95.4% |
| Common | 422 | 4.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1239 | 14.2% |
| h | 712 | 8.2% |
| d | 657 | 7.5% |
| t | 596 | 6.8% |
| o | 564 | 6.5% |
| a | 564 | 6.5% |
| c | 488 | 5.6% |
| S | 443 | 5.1% |
| m | 443 | 5.1% |
| w | 443 | 5.1% |
| Other values (12) | 2575 | 29.5% |
Common
| Value | Count | Frequency (%) |
| 422 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9146 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1239 | 13.5% |
| h | 712 | 7.8% |
| d | 657 | 7.2% |
| t | 596 | 6.5% |
| o | 564 | 6.2% |
| a | 564 | 6.2% |
| c | 488 | 5.3% |
| S | 443 | 4.8% |
| m | 443 | 4.8% |
| w | 443 | 4.8% |
| Other values (13) | 2997 | 32.8% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.719581749 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9173 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Undecided |
|---|
| 5th row | Very Much |
|---|
| Value | Count | Frequency (%) |
| very | 436 | 27.3% |
| much | 436 | 27.3% |
| somewhat | 378 | 23.6% |
| undecided | 155 | 9.7% |
| not | 83 | 5.2% |
| really | 55 | 3.4% |
| at | 28 | 1.8% |
| all | 28 | 1.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1179 | 12.9% |
| h | 814 | 8.9% |
| c | 591 | 6.4% |
| 547 | 6.0% |
| y | 491 | 5.4% |
| t | 489 | 5.3% |
| d | 465 | 5.1% |
| a | 461 | 5.0% |
| o | 461 | 5.0% |
| V | 436 | 4.8% |
| Other values (13) | 3239 | 35.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7055 | 76.9% |
| Uppercase Letter | 1571 | 17.1% |
| Space Separator | 547 | 6.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1179 | 16.7% |
| h | 814 | 11.5% |
| c | 591 | 8.4% |
| y | 491 | 7.0% |
| t | 489 | 6.9% |
| d | 465 | 6.6% |
| a | 461 | 6.5% |
| o | 461 | 6.5% |
| r | 436 | 6.2% |
| u | 436 | 6.2% |
| Other values (5) | 1232 | 17.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 436 | 27.8% |
| M | 436 | 27.8% |
| S | 378 | 24.1% |
| U | 155 | 9.9% |
| N | 83 | 5.3% |
| R | 55 | 3.5% |
| A | 28 | 1.8% |
Space Separator
| Value | Count | Frequency (%) |
| 547 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8626 | 94.0% |
| Common | 547 | 6.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1179 | 13.7% |
| h | 814 | 9.4% |
| c | 591 | 6.9% |
| y | 491 | 5.7% |
| t | 489 | 5.7% |
| d | 465 | 5.4% |
| a | 461 | 5.3% |
| o | 461 | 5.3% |
| V | 436 | 5.1% |
| r | 436 | 5.1% |
| Other values (12) | 2803 | 32.5% |
Common
| Value | Count | Frequency (%) |
| 547 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9173 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1179 | 12.9% |
| h | 814 | 8.9% |
| c | 591 | 6.4% |
| 547 | 6.0% |
| y | 491 | 5.4% |
| t | 489 | 5.3% |
| d | 465 | 5.1% |
| a | 461 | 5.0% |
| o | 461 | 5.0% |
| V | 436 | 4.8% |
| Other values (13) | 3239 | 35.3% |
| Distinct | 5 |
|---|
| Distinct (%) | 0.5% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 9 |
|---|
| Mean length | 8.719581749 |
|---|
| Min length | 8 |
|---|
Characters and Unicode
| Total characters | 9173 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Somewhat |
|---|
| 2nd row | Somewhat |
|---|
| 3rd row | Somewhat |
|---|
| 4th row | Undecided |
|---|
| 5th row | Somewhat |
|---|
| Value | Count | Frequency (%) |
| somewhat | 424 | 29.6% |
| undecided | 294 | 20.5% |
| very | 205 | 14.3% |
| much | 205 | 14.3% |
| not | 129 | 9.0% |
| really | 84 | 5.9% |
| at | 45 | 3.1% |
| all | 45 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1301 | 14.2% |
| d | 882 | 9.6% |
| h | 629 | 6.9% |
| t | 598 | 6.5% |
| o | 553 | 6.0% |
| a | 553 | 6.0% |
| c | 499 | 5.4% |
| S | 424 | 4.6% |
| w | 424 | 4.6% |
| m | 424 | 4.6% |
| Other values (13) | 2886 | 31.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7408 | 80.8% |
| Uppercase Letter | 1386 | 15.1% |
| Space Separator | 379 | 4.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1301 | 17.6% |
| d | 882 | 11.9% |
| h | 629 | 8.5% |
| t | 598 | 8.1% |
| o | 553 | 7.5% |
| a | 553 | 7.5% |
| c | 499 | 6.7% |
| w | 424 | 5.7% |
| m | 424 | 5.7% |
| n | 294 | 4.0% |
| Other values (5) | 1251 | 16.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 424 | 30.6% |
| U | 294 | 21.2% |
| V | 205 | 14.8% |
| M | 205 | 14.8% |
| N | 129 | 9.3% |
| R | 84 | 6.1% |
| A | 45 | 3.2% |
Space Separator
| Value | Count | Frequency (%) |
| 379 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8794 | 95.9% |
| Common | 379 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1301 | 14.8% |
| d | 882 | 10.0% |
| h | 629 | 7.2% |
| t | 598 | 6.8% |
| o | 553 | 6.3% |
| a | 553 | 6.3% |
| c | 499 | 5.7% |
| S | 424 | 4.8% |
| w | 424 | 4.8% |
| m | 424 | 4.8% |
| Other values (12) | 2507 | 28.5% |
Common
| Value | Count | Frequency (%) |
| 379 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9173 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1301 | 14.2% |
| d | 882 | 9.6% |
| h | 629 | 6.9% |
| t | 598 | 6.5% |
| o | 553 | 6.0% |
| a | 553 | 6.0% |
| c | 499 | 5.4% |
| S | 424 | 4.6% |
| w | 424 | 4.6% |
| m | 424 | 4.6% |
| Other values (13) | 2886 | 31.5% |
| Distinct | 377 |
|---|
| Distinct (%) | 35.8% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 334 |
|---|
| Median length | 289 |
|---|
| Mean length | 18.81273764 |
|---|
| Min length | 1 |
|---|
Characters and Unicode
| Total characters | 19791 |
|---|
| Distinct characters | 68 |
|---|
| Distinct categories | 9 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 292 ? |
|---|
| Unique (%) | 27.8% |
|---|
Sample
| 1st row | test |
|---|
| 2nd row | Live class |
|---|
| 3rd row | Formative |
|---|
| 4th row | Individual assignment |
|---|
| 5th row | Portfolio |
|---|
| Value | Count | Frequency (%) |
| lecture | 517 | 16.8% |
| live | 505 | 16.4% |
| youtube | 402 | 13.0% |
| video | 318 | 10.3% |
| and | 93 | 3.0% |
| recorded | 69 | 2.2% |
| demonstration | 31 | 1.0% |
| or | 29 | 0.9% |
| pre-recorded | 29 | 0.9% |
| i | 28 | 0.9% |
| Other values (388) | 1063 | 34.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3216 | 16.2% |
| 2172 | 11.0% |
| u | 1496 | 7.6% |
| o | 1289 | 6.5% |
| t | 1239 | 6.3% |
| i | 1209 | 6.1% |
| r | 1122 | 5.7% |
| c | 821 | 4.1% |
| d | 797 | 4.0% |
| l | 722 | 3.6% |
| Other values (58) | 5708 | 28.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15350 | 77.6% |
| Space Separator | 2172 | 11.0% |
| Uppercase Letter | 2014 | 10.2% |
| Other Punctuation | 194 | 1.0% |
| Dash Punctuation | 47 | 0.2% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
| Decimal Number | 3 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3216 | 21.0% |
| u | 1496 | 9.7% |
| o | 1289 | 8.4% |
| t | 1239 | 8.1% |
| i | 1209 | 7.9% |
| r | 1122 | 7.3% |
| c | 821 | 5.3% |
| d | 797 | 5.2% |
| l | 722 | 4.7% |
| v | 714 | 4.7% |
| Other values (16) | 2725 | 17.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 551 | 27.4% |
| Y | 363 | 18.0% |
| T | 200 | 9.9% |
| V | 168 | 8.3% |
| E | 96 | 4.8% |
| I | 79 | 3.9% |
| O | 63 | 3.1% |
| D | 60 | 3.0% |
| P | 55 | 2.7% |
| R | 51 | 2.5% |
| Other values (16) | 328 | 16.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 128 | 66.0% |
| . | 33 | 17.0% |
| / | 18 | 9.3% |
| & | 8 | 4.1% |
| ' | 3 | 1.5% |
| : | 2 | 1.0% |
| ? | 1 | 0.5% |
| @ | 1 | 0.5% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 1 | 33.3% |
| 2 | 1 | 33.3% |
| 3 | 1 | 33.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2172 | 100.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 47 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 | 100.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17364 | 87.7% |
| Common | 2427 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3216 | 18.5% |
| u | 1496 | 8.6% |
| o | 1289 | 7.4% |
| t | 1239 | 7.1% |
| i | 1209 | 7.0% |
| r | 1122 | 6.5% |
| c | 821 | 4.7% |
| d | 797 | 4.6% |
| l | 722 | 4.2% |
| v | 714 | 4.1% |
| Other values (42) | 4739 | 27.3% |
Common
| Value | Count | Frequency (%) |
| 2172 | 89.5% |
| , | 128 | 5.3% |
| - | 47 | 1.9% |
| . | 33 | 1.4% |
| / | 18 | 0.7% |
| & | 8 | 0.3% |
| ( | 5 | 0.2% |
| ) | 5 | 0.2% |
| ' | 3 | 0.1% |
| : | 2 | 0.1% |
| Other values (6) | 6 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19791 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3216 | 16.2% |
| 2172 | 11.0% |
| u | 1496 | 7.6% |
| o | 1289 | 6.5% |
| t | 1239 | 6.3% |
| i | 1209 | 6.1% |
| r | 1122 | 5.7% |
| c | 821 | 4.1% |
| d | 797 | 4.0% |
| l | 722 | 3.6% |
| Other values (58) | 5708 | 28.8% |
| Distinct | 2 |
|---|
| Distinct (%) | 0.2% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 3 |
|---|
| Median length | 3 |
|---|
| Mean length | 2.708174905 |
|---|
| Min length | 2 |
|---|
Characters and Unicode
| Total characters | 2849 |
|---|
| Distinct characters | 5 |
|---|
| Distinct categories | 2 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Yes |
|---|
| 2nd row | Yes |
|---|
| 3rd row | Yes |
|---|
| 4th row | Yes |
|---|
| 5th row | No |
|---|
| Value | Count | Frequency (%) |
| yes | 745 | 70.8% |
| no | 307 | 29.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 745 | 26.1% |
| e | 745 | 26.1% |
| s | 745 | 26.1% |
| N | 307 | 10.8% |
| o | 307 | 10.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1797 | 63.1% |
| Uppercase Letter | 1052 | 36.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 745 | 41.5% |
| s | 745 | 41.5% |
| o | 307 | 17.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 745 | 70.8% |
| N | 307 | 29.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2849 | 100.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 745 | 26.1% |
| e | 745 | 26.1% |
| s | 745 | 26.1% |
| N | 307 | 10.8% |
| o | 307 | 10.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2849 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 745 | 26.1% |
| e | 745 | 26.1% |
| s | 745 | 26.1% |
| N | 307 | 10.8% |
| o | 307 | 10.8% |
| Distinct | 2 |
|---|
| Distinct (%) | 0.2% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 3 |
|---|
| Median length | 3 |
|---|
| Mean length | 2.823193916 |
|---|
| Min length | 2 |
|---|
Characters and Unicode
| Total characters | 2970 |
|---|
| Distinct characters | 5 |
|---|
| Distinct categories | 2 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Yes |
|---|
| 2nd row | Yes |
|---|
| 3rd row | Yes |
|---|
| 4th row | No |
|---|
| 5th row | Yes |
|---|
| Value | Count | Frequency (%) |
| yes | 866 | 82.3% |
| no | 186 | 17.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 866 | 29.2% |
| e | 866 | 29.2% |
| s | 866 | 29.2% |
| N | 186 | 6.3% |
| o | 186 | 6.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1918 | 64.6% |
| Uppercase Letter | 1052 | 35.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 866 | 45.2% |
| s | 866 | 45.2% |
| o | 186 | 9.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 866 | 82.3% |
| N | 186 | 17.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2970 | 100.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 866 | 29.2% |
| e | 866 | 29.2% |
| s | 866 | 29.2% |
| N | 186 | 6.3% |
| o | 186 | 6.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2970 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 866 | 29.2% |
| e | 866 | 29.2% |
| s | 866 | 29.2% |
| N | 186 | 6.3% |
| o | 186 | 6.3% |
| Distinct | 2 |
|---|
| Distinct (%) | 0.2% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 3 |
|---|
| Median length | 3 |
|---|
| Mean length | 2.979087452 |
|---|
| Min length | 2 |
|---|
Characters and Unicode
| Total characters | 3134 |
|---|
| Distinct characters | 5 |
|---|
| Distinct categories | 2 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Yes |
|---|
| 2nd row | Yes |
|---|
| 3rd row | Yes |
|---|
| 4th row | Yes |
|---|
| 5th row | Yes |
|---|
| Value | Count | Frequency (%) |
| yes | 1030 | 97.9% |
| no | 22 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| Y | 1030 | 32.9% |
| e | 1030 | 32.9% |
| s | 1030 | 32.9% |
| N | 22 | 0.7% |
| o | 22 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2082 | 66.4% |
| Uppercase Letter | 1052 | 33.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1030 | 49.5% |
| s | 1030 | 49.5% |
| o | 22 | 1.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 1030 | 97.9% |
| N | 22 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3134 | 100.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| Y | 1030 | 32.9% |
| e | 1030 | 32.9% |
| s | 1030 | 32.9% |
| N | 22 | 0.7% |
| o | 22 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3134 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| Y | 1030 | 32.9% |
| e | 1030 | 32.9% |
| s | 1030 | 32.9% |
| N | 22 | 0.7% |
| o | 22 | 0.7% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 40 |
|---|
| Median length | 21 |
|---|
| Mean length | 28.4134981 |
|---|
| Min length | 21 |
|---|
Characters and Unicode
| Total characters | 29891 |
|---|
| Distinct characters | 26 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Read the instructions |
|---|
| 2nd row | Have a go and learn by "trial and error" |
|---|
| 3rd row | Read the instructions |
|---|
| 4th row | Read the instructions |
|---|
| 5th row | Listen to or ask for an explaination |
|---|
| Value | Count | Frequency (%) |
| read | 575 | 10.7% |
| instructions | 575 | 10.7% |
| the | 575 | 10.7% |
| and | 322 | 6.0% |
| for | 316 | 5.9% |
| an | 316 | 5.9% |
| explaination | 316 | 5.9% |
| ask | 316 | 5.9% |
| or | 316 | 5.9% |
| to | 316 | 5.9% |
| Other values (8) | 1443 | 26.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4334 | 14.5% |
| n | 2897 | 9.7% |
| t | 2834 | 9.5% |
| a | 2805 | 9.4% |
| e | 2265 | 7.6% |
| i | 2259 | 7.6% |
| o | 2161 | 7.2% |
| r | 2012 | 6.7% |
| s | 1782 | 6.0% |
| d | 897 | 3.0% |
| Other values (16) | 5645 | 18.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24183 | 80.9% |
| Space Separator | 4334 | 14.5% |
| Uppercase Letter | 1052 | 3.5% |
| Other Punctuation | 322 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2897 | 12.0% |
| t | 2834 | 11.7% |
| a | 2805 | 11.6% |
| e | 2265 | 9.4% |
| i | 2259 | 9.3% |
| o | 2161 | 8.9% |
| r | 2012 | 8.3% |
| s | 1782 | 7.4% |
| d | 897 | 3.7% |
| l | 638 | 2.6% |
| Other values (11) | 3633 | 15.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 575 | 54.7% |
| L | 316 | 30.0% |
| H | 161 | 15.3% |
Space Separator
| Value | Count | Frequency (%) |
| 4334 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 322 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25235 | 84.4% |
| Common | 4656 | 15.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 2897 | 11.5% |
| t | 2834 | 11.2% |
| a | 2805 | 11.1% |
| e | 2265 | 9.0% |
| i | 2259 | 9.0% |
| o | 2161 | 8.6% |
| r | 2012 | 8.0% |
| s | 1782 | 7.1% |
| d | 897 | 3.6% |
| l | 638 | 2.5% |
| Other values (14) | 4685 | 18.6% |
Common
| Value | Count | Frequency (%) |
| 4334 | 93.1% |
| " | 322 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29891 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4334 | 14.5% |
| n | 2897 | 9.7% |
| t | 2834 | 9.5% |
| a | 2805 | 9.4% |
| e | 2265 | 7.6% |
| i | 2259 | 7.6% |
| o | 2161 | 7.2% |
| r | 2012 | 6.7% |
| s | 1782 | 6.0% |
| d | 897 | 3.0% |
| Other values (16) | 5645 | 18.9% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 37 |
|---|
| Median length | 13 |
|---|
| Mean length | 17.09505703 |
|---|
| Min length | 13 |
|---|
Characters and Unicode
| Total characters | 17984 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Look at a map |
|---|
| 2nd row | Look at a map |
|---|
| 3rd row | Look at a map |
|---|
| 4th row | Ask for spoken directions |
|---|
| 5th row | Ask for spoken directions |
|---|
| Value | Count | Frequency (%) |
| a | 715 | 16.8% |
| look | 704 | 16.6% |
| at | 704 | 16.6% |
| map | 704 | 16.6% |
| ask | 337 | 7.9% |
| for | 337 | 7.9% |
| spoken | 337 | 7.9% |
| directions | 337 | 7.9% |
| follow | 11 | 0.3% |
| my | 11 | 0.3% |
| Other values (5) | 55 | 1.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3200 | 17.8% |
| o | 2474 | 13.8% |
| a | 2145 | 11.9% |
| k | 1378 | 7.7% |
| s | 1055 | 5.9% |
| p | 1052 | 5.8% |
| t | 1041 | 5.8% |
| m | 737 | 4.1% |
| e | 707 | 3.9% |
| L | 704 | 3.9% |
| Other values (13) | 3491 | 19.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13732 | 76.4% |
| Space Separator | 3200 | 17.8% |
| Uppercase Letter | 1052 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2474 | 18.0% |
| a | 2145 | 15.6% |
| k | 1378 | 10.0% |
| s | 1055 | 7.7% |
| p | 1052 | 7.7% |
| t | 1041 | 7.6% |
| m | 737 | 5.4% |
| e | 707 | 5.1% |
| n | 685 | 5.0% |
| r | 685 | 5.0% |
| Other values (9) | 1773 | 12.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 704 | 66.9% |
| A | 337 | 32.0% |
| F | 11 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 3200 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14784 | 82.2% |
| Common | 3200 | 17.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2474 | 16.7% |
| a | 2145 | 14.5% |
| k | 1378 | 9.3% |
| s | 1055 | 7.1% |
| p | 1052 | 7.1% |
| t | 1041 | 7.0% |
| m | 737 | 5.0% |
| e | 707 | 4.8% |
| L | 704 | 4.8% |
| n | 685 | 4.6% |
| Other values (12) | 2806 | 19.0% |
Common
| Value | Count | Frequency (%) |
| 3200 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17984 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3200 | 17.8% |
| o | 2474 | 13.8% |
| a | 2145 | 11.9% |
| k | 1378 | 7.7% |
| s | 1055 | 5.9% |
| p | 1052 | 5.8% |
| t | 1041 | 5.8% |
| m | 737 | 4.1% |
| e | 707 | 3.9% |
| L | 704 | 3.9% |
| Other values (13) | 3491 | 19.4% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 37 |
|---|
| Median length | 15 |
|---|
| Mean length | 20.11977186 |
|---|
| Min length | 15 |
|---|
Characters and Unicode
| Total characters | 21166 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Follow a recipe |
|---|
| 2nd row | Follow a recipe |
|---|
| 3rd row | Follow a recipe |
|---|
| 4th row | Follow a recipe |
|---|
| 5th row | Follow a recipe |
|---|
| Value | Count | Frequency (%) |
| follow | 976 | 23.9% |
| a | 859 | 21.1% |
| recipe | 783 | 19.2% |
| my | 193 | 4.7% |
| instinct | 193 | 4.7% |
| tasting | 193 | 4.7% |
| as | 193 | 4.7% |
| i | 193 | 4.7% |
| cook | 193 | 4.7% |
| call | 76 | 1.9% |
| Other values (3) | 228 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3028 | 14.3% |
| o | 2490 | 11.8% |
| l | 2180 | 10.3% |
| e | 1718 | 8.1% |
| i | 1590 | 7.5% |
| a | 1473 | 7.0% |
| c | 1169 | 5.5% |
| F | 976 | 4.6% |
| w | 976 | 4.6% |
| r | 935 | 4.4% |
| Other values (14) | 4631 | 21.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16700 | 78.9% |
| Space Separator | 3028 | 14.3% |
| Uppercase Letter | 1245 | 5.9% |
| Other Punctuation | 193 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2490 | 14.9% |
| l | 2180 | 13.1% |
| e | 1718 | 10.3% |
| i | 1590 | 9.5% |
| a | 1473 | 8.8% |
| c | 1169 | 7.0% |
| w | 976 | 5.8% |
| r | 935 | 5.6% |
| p | 859 | 5.1% |
| t | 848 | 5.1% |
| Other values (9) | 2462 | 14.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 976 | 78.4% |
| I | 193 | 15.5% |
| C | 76 | 6.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3028 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 193 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17945 | 84.8% |
| Common | 3221 | 15.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2490 | 13.9% |
| l | 2180 | 12.1% |
| e | 1718 | 9.6% |
| i | 1590 | 8.9% |
| a | 1473 | 8.2% |
| c | 1169 | 6.5% |
| F | 976 | 5.4% |
| w | 976 | 5.4% |
| r | 935 | 5.2% |
| p | 859 | 4.8% |
| Other values (12) | 3579 | 19.9% |
Common
| Value | Count | Frequency (%) |
| 3028 | 94.0% |
| , | 193 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21166 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3028 | 14.3% |
| o | 2490 | 11.8% |
| l | 2180 | 10.3% |
| e | 1718 | 8.1% |
| i | 1590 | 7.5% |
| a | 1473 | 7.0% |
| c | 1169 | 5.5% |
| F | 976 | 4.6% |
| w | 976 | 4.6% |
| r | 935 | 4.4% |
| Other values (14) | 4631 | 21.9% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 34 |
|---|
| Median length | 16 |
|---|
| Mean length | 22.40114068 |
|---|
| Min length | 16 |
|---|
Characters and Unicode
| Total characters | 23566 |
|---|
| Distinct characters | 25 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Write Instructions |
|---|
| 2nd row | Explain verbally |
|---|
| 3rd row | Demonstrate and let them have a go |
|---|
| 4th row | Demonstrate and let them have a go |
|---|
| 5th row | Explain verbally |
|---|
| Value | Count | Frequency (%) |
| explain | 573 | 14.7% |
| verbally | 573 | 14.7% |
| demonstrate | 361 | 9.2% |
| and | 361 | 9.2% |
| let | 361 | 9.2% |
| them | 361 | 9.2% |
| have | 361 | 9.2% |
| a | 361 | 9.2% |
| go | 361 | 9.2% |
| write | 118 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2857 | 12.1% |
| a | 2590 | 11.0% |
| e | 2496 | 10.6% |
| l | 2080 | 8.8% |
| t | 1798 | 7.6% |
| n | 1531 | 6.5% |
| r | 1170 | 5.0% |
| v | 934 | 4.0% |
| o | 840 | 3.6% |
| i | 809 | 3.4% |
| Other values (15) | 6461 | 27.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19539 | 82.9% |
| Space Separator | 2857 | 12.1% |
| Uppercase Letter | 1170 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2590 | 13.3% |
| e | 2496 | 12.8% |
| l | 2080 | 10.6% |
| t | 1798 | 9.2% |
| n | 1531 | 7.8% |
| r | 1170 | 6.0% |
| v | 934 | 4.8% |
| o | 840 | 4.3% |
| i | 809 | 4.1% |
| m | 722 | 3.7% |
| Other values (10) | 4569 | 23.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 573 | 49.0% |
| D | 361 | 30.9% |
| W | 118 | 10.1% |
| I | 118 | 10.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2857 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20709 | 87.9% |
| Common | 2857 | 12.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2590 | 12.5% |
| e | 2496 | 12.1% |
| l | 2080 | 10.0% |
| t | 1798 | 8.7% |
| n | 1531 | 7.4% |
| r | 1170 | 5.6% |
| v | 934 | 4.5% |
| o | 840 | 4.1% |
| i | 809 | 3.9% |
| m | 722 | 3.5% |
| Other values (14) | 5739 | 27.7% |
Common
| Value | Count | Frequency (%) |
| 2857 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23566 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2857 | 12.1% |
| a | 2590 | 11.0% |
| e | 2496 | 10.6% |
| l | 2080 | 8.8% |
| t | 1798 | 7.6% |
| n | 1531 | 6.5% |
| r | 1170 | 5.0% |
| v | 934 | 4.0% |
| o | 840 | 3.6% |
| i | 809 | 3.4% |
| Other values (15) | 6461 | 27.4% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 26 |
|---|
| Median length | 19 |
|---|
| Mean length | 20.13117871 |
|---|
| Min length | 19 |
|---|
Characters and Unicode
| Total characters | 21178 |
|---|
| Distinct characters | 19 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | I hear what you are saying |
|---|
| 2nd row | I know how you feel |
|---|
| 3rd row | I know how you feel |
|---|
| 4th row | I see what you mean |
|---|
| 5th row | I hear what you are saying |
|---|
| Value | Count | Frequency (%) |
| i | 1052 | 19.4% |
| you | 1052 | 19.4% |
| what | 596 | 11.0% |
| know | 456 | 8.4% |
| how | 456 | 8.4% |
| feel | 456 | 8.4% |
| see | 426 | 7.8% |
| mean | 426 | 7.8% |
| hear | 170 | 3.1% |
| are | 170 | 3.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4378 | 20.7% |
| e | 2530 | 11.9% |
| o | 1964 | 9.3% |
| a | 1532 | 7.2% |
| w | 1508 | 7.1% |
| h | 1222 | 5.8% |
| y | 1222 | 5.8% |
| I | 1052 | 5.0% |
| n | 1052 | 5.0% |
| u | 1052 | 5.0% |
| Other values (9) | 3666 | 17.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15748 | 74.4% |
| Space Separator | 4378 | 20.7% |
| Uppercase Letter | 1052 | 5.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2530 | 16.1% |
| o | 1964 | 12.5% |
| a | 1532 | 9.7% |
| w | 1508 | 9.6% |
| h | 1222 | 7.8% |
| y | 1222 | 7.8% |
| n | 1052 | 6.7% |
| u | 1052 | 6.7% |
| s | 596 | 3.8% |
| t | 596 | 3.8% |
| Other values (7) | 2474 | 15.7% |
Space Separator
| Value | Count | Frequency (%) |
| 4378 | 100.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1052 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16800 | 79.3% |
| Common | 4378 | 20.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2530 | 15.1% |
| o | 1964 | 11.7% |
| a | 1532 | 9.1% |
| w | 1508 | 9.0% |
| h | 1222 | 7.3% |
| y | 1222 | 7.3% |
| I | 1052 | 6.3% |
| n | 1052 | 6.3% |
| u | 1052 | 6.3% |
| s | 596 | 3.5% |
| Other values (8) | 3070 | 18.3% |
Common
| Value | Count | Frequency (%) |
| 4378 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21178 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4378 | 20.7% |
| e | 2530 | 11.9% |
| o | 1964 | 9.3% |
| a | 1532 | 7.2% |
| w | 1508 | 7.1% |
| h | 1222 | 5.8% |
| y | 1222 | 5.8% |
| I | 1052 | 5.0% |
| n | 1052 | 5.0% |
| u | 1052 | 5.0% |
| Other values (9) | 3666 | 17.3% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 10 |
|---|
| Median length | 7 |
|---|
| Mean length | 8.328897338 |
|---|
| Min length | 7 |
|---|
Characters and Unicode
| Total characters | 8762 |
|---|
| Distinct characters | 13 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Show me |
|---|
| 2nd row | Let me try |
|---|
| 3rd row | Let me try |
|---|
| 4th row | Tell me |
|---|
| 5th row | Let me try |
|---|
| Value | Count | Frequency (%) |
| me | 1052 | 40.9% |
| let | 466 | 18.1% |
| try | 466 | 18.1% |
| show | 315 | 12.3% |
| tell | 271 | 10.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1789 | 20.4% |
| 1518 | 17.3% |
| m | 1052 | 12.0% |
| t | 932 | 10.6% |
| l | 542 | 6.2% |
| L | 466 | 5.3% |
| r | 466 | 5.3% |
| y | 466 | 5.3% |
| S | 315 | 3.6% |
| h | 315 | 3.6% |
| Other values (3) | 901 | 10.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6192 | 70.7% |
| Space Separator | 1518 | 17.3% |
| Uppercase Letter | 1052 | 12.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1789 | 28.9% |
| m | 1052 | 17.0% |
| t | 932 | 15.1% |
| l | 542 | 8.8% |
| r | 466 | 7.5% |
| y | 466 | 7.5% |
| h | 315 | 5.1% |
| o | 315 | 5.1% |
| w | 315 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 466 | 44.3% |
| S | 315 | 29.9% |
| T | 271 | 25.8% |
Space Separator
| Value | Count | Frequency (%) |
| 1518 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7244 | 82.7% |
| Common | 1518 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1789 | 24.7% |
| m | 1052 | 14.5% |
| t | 932 | 12.9% |
| l | 542 | 7.5% |
| L | 466 | 6.4% |
| r | 466 | 6.4% |
| y | 466 | 6.4% |
| S | 315 | 4.3% |
| h | 315 | 4.3% |
| o | 315 | 4.3% |
| Other values (2) | 586 | 8.1% |
Common
| Value | Count | Frequency (%) |
| 1518 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8762 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1789 | 20.4% |
| 1518 | 17.3% |
| m | 1052 | 12.0% |
| t | 932 | 10.6% |
| l | 542 | 6.2% |
| L | 466 | 5.3% |
| r | 466 | 5.3% |
| y | 466 | 5.3% |
| S | 315 | 3.6% |
| h | 315 | 3.6% |
| Other values (3) | 901 | 10.3% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 20 |
|---|
| Median length | 17 |
|---|
| Mean length | 17.70722433 |
|---|
| Min length | 13 |
|---|
Characters and Unicode
| Total characters | 18628 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Listen to me explain |
|---|
| 2nd row | Listen to me explain |
|---|
| 3rd row | Watch how I do it |
|---|
| 4th row | Listen to me explain |
|---|
| 5th row | Listen to me explain |
|---|
| Value | Count | Frequency (%) |
| listen | 496 | 10.8% |
| to | 496 | 10.8% |
| me | 496 | 10.8% |
| explain | 496 | 10.8% |
| watch | 370 | 8.1% |
| how | 370 | 8.1% |
| i | 370 | 8.1% |
| do | 370 | 8.1% |
| it | 370 | 8.1% |
| you | 186 | 4.1% |
| Other values (3) | 558 | 12.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3526 | 18.9% |
| t | 1732 | 9.3% |
| e | 1674 | 9.0% |
| o | 1608 | 8.6% |
| i | 1362 | 7.3% |
| a | 1238 | 6.6% |
| n | 992 | 5.3% |
| h | 926 | 5.0% |
| L | 496 | 2.7% |
| p | 496 | 2.7% |
| Other values (13) | 4578 | 24.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13680 | 73.4% |
| Space Separator | 3526 | 18.9% |
| Uppercase Letter | 1422 | 7.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1732 | 12.7% |
| e | 1674 | 12.2% |
| o | 1608 | 11.8% |
| i | 1362 | 10.0% |
| a | 1238 | 9.0% |
| n | 992 | 7.3% |
| h | 926 | 6.8% |
| p | 496 | 3.6% |
| l | 496 | 3.6% |
| x | 496 | 3.6% |
| Other values (8) | 2660 | 19.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 496 | 34.9% |
| W | 370 | 26.0% |
| I | 370 | 26.0% |
| Y | 186 | 13.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3526 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15102 | 81.1% |
| Common | 3526 | 18.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1732 | 11.5% |
| e | 1674 | 11.1% |
| o | 1608 | 10.6% |
| i | 1362 | 9.0% |
| a | 1238 | 8.2% |
| n | 992 | 6.6% |
| h | 926 | 6.1% |
| L | 496 | 3.3% |
| p | 496 | 3.3% |
| l | 496 | 3.3% |
| Other values (12) | 4082 | 27.0% |
Common
| Value | Count | Frequency (%) |
| 3526 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18628 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3526 | 18.9% |
| t | 1732 | 9.3% |
| e | 1674 | 9.0% |
| o | 1608 | 8.6% |
| i | 1362 | 7.3% |
| a | 1238 | 6.6% |
| n | 992 | 5.3% |
| h | 926 | 5.0% |
| L | 496 | 2.7% |
| p | 496 | 2.7% |
| Other values (13) | 4578 | 24.6% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 64 |
|---|
| Median length | 14 |
|---|
| Mean length | 27.128327 |
|---|
| Min length | 5 |
|---|
Characters and Unicode
| Total characters | 28539 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Write a letter |
|---|
| 2nd row | Go back to the store, or send the faulty item to the head office |
|---|
| 3rd row | Go back to the store, or send the faulty item to the head office |
|---|
| 4th row | Go back to the store, or send the faulty item to the head office |
|---|
| 5th row | Write a letter |
|---|
| Value | Count | Frequency (%) |
| the | 1056 | 17.1% |
| to | 704 | 11.4% |
| phone | 421 | 6.8% |
| go | 352 | 5.7% |
| back | 352 | 5.7% |
| store | 352 | 5.7% |
| or | 352 | 5.7% |
| send | 352 | 5.7% |
| faulty | 352 | 5.7% |
| item | 352 | 5.7% |
| Other values (5) | 1541 | 24.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5134 | 18.0% |
| e | 4074 | 14.3% |
| t | 3653 | 12.8% |
| o | 2533 | 8.9% |
| h | 1829 | 6.4% |
| a | 1335 | 4.7% |
| r | 1262 | 4.4% |
| f | 1056 | 3.7% |
| i | 983 | 3.4% |
| n | 773 | 2.7% |
| Other values (13) | 5907 | 20.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22001 | 77.1% |
| Space Separator | 5134 | 18.0% |
| Uppercase Letter | 1052 | 3.7% |
| Other Punctuation | 352 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4074 | 18.5% |
| t | 3653 | 16.6% |
| o | 2533 | 11.5% |
| h | 1829 | 8.3% |
| a | 1335 | 6.1% |
| r | 1262 | 5.7% |
| f | 1056 | 4.8% |
| i | 983 | 4.5% |
| n | 773 | 3.5% |
| d | 704 | 3.2% |
| Other values (8) | 3799 | 17.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 421 | 40.0% |
| G | 352 | 33.5% |
| W | 279 | 26.5% |
Space Separator
| Value | Count | Frequency (%) |
| 5134 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 352 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23053 | 80.8% |
| Common | 5486 | 19.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4074 | 17.7% |
| t | 3653 | 15.8% |
| o | 2533 | 11.0% |
| h | 1829 | 7.9% |
| a | 1335 | 5.8% |
| r | 1262 | 5.5% |
| f | 1056 | 4.6% |
| i | 983 | 4.3% |
| n | 773 | 3.4% |
| d | 704 | 3.1% |
| Other values (11) | 4851 | 21.0% |
Common
| Value | Count | Frequency (%) |
| 5134 | 93.6% |
| , | 352 | 6.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28539 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5134 | 18.0% |
| e | 4074 | 14.3% |
| t | 3653 | 12.8% |
| o | 2533 | 8.9% |
| h | 1829 | 6.4% |
| a | 1335 | 4.7% |
| r | 1262 | 4.4% |
| f | 1056 | 3.7% |
| i | 983 | 3.4% |
| n | 773 | 2.7% |
| Other values (13) | 5907 | 20.7% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 36 |
|---|
| Median length | 21 |
|---|
| Mean length | 27.35741445 |
|---|
| Min length | 20 |
|---|
Characters and Unicode
| Total characters | 28780 |
|---|
| Distinct characters | 20 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Music or conversation |
|---|
| 2nd row | Physical activities or making things |
|---|
| 3rd row | Physical activities or making things |
|---|
| 4th row | Museums or galleries |
|---|
| 5th row | Music or conversation |
|---|
| Value | Count | Frequency (%) |
| or | 1052 | 25.9% |
| music | 460 | 11.3% |
| conversation | 460 | 11.3% |
| physical | 455 | 11.2% |
| activities | 455 | 11.2% |
| making | 455 | 11.2% |
| things | 455 | 11.2% |
| museums | 137 | 3.4% |
| galleries | 137 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3787 | 13.2% |
| 3014 | 10.5% |
| s | 2696 | 9.4% |
| o | 1972 | 6.9% |
| a | 1962 | 6.8% |
| c | 1830 | 6.4% |
| n | 1830 | 6.4% |
| t | 1825 | 6.3% |
| r | 1649 | 5.7% |
| e | 1326 | 4.6% |
| Other values (10) | 6889 | 23.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24714 | 85.9% |
| Space Separator | 3014 | 10.5% |
| Uppercase Letter | 1052 | 3.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 3787 | 15.3% |
| s | 2696 | 10.9% |
| o | 1972 | 8.0% |
| a | 1962 | 7.9% |
| c | 1830 | 7.4% |
| n | 1830 | 7.4% |
| t | 1825 | 7.4% |
| r | 1649 | 6.7% |
| e | 1326 | 5.4% |
| g | 1047 | 4.2% |
| Other values (7) | 4790 | 19.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 597 | 56.7% |
| P | 455 | 43.3% |
Space Separator
| Value | Count | Frequency (%) |
| 3014 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25766 | 89.5% |
| Common | 3014 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 3787 | 14.7% |
| s | 2696 | 10.5% |
| o | 1972 | 7.7% |
| a | 1962 | 7.6% |
| c | 1830 | 7.1% |
| n | 1830 | 7.1% |
| t | 1825 | 7.1% |
| r | 1649 | 6.4% |
| e | 1326 | 5.1% |
| g | 1047 | 4.1% |
| Other values (9) | 5842 | 22.7% |
Common
| Value | Count | Frequency (%) |
| 3014 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28780 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 3787 | 13.2% |
| 3014 | 10.5% |
| s | 2696 | 9.4% |
| o | 1972 | 6.9% |
| a | 1962 | 6.8% |
| c | 1830 | 6.4% |
| n | 1830 | 6.4% |
| t | 1825 | 6.3% |
| r | 1649 | 5.7% |
| e | 1326 | 4.6% |
| Other values (10) | 6889 | 23.9% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 23 |
|---|
| Median length | 15 |
|---|
| Mean length | 17.80038023 |
|---|
| Min length | 15 |
|---|
Characters and Unicode
| Total characters | 18726 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Discuss with shop staff |
|---|
| 2nd row | Look and decide |
|---|
| 3rd row | Look and decide |
|---|
| 4th row | Look and decide |
|---|
| 5th row | Look and decide |
|---|
| Value | Count | Frequency (%) |
| look | 639 | 16.3% |
| and | 639 | 16.3% |
| decide | 639 | 16.3% |
| try | 358 | 9.1% |
| on | 358 | 9.1% |
| handle | 358 | 9.1% |
| or | 358 | 9.1% |
| test | 358 | 9.1% |
| discuss | 55 | 1.4% |
| with | 55 | 1.4% |
| Other values (2) | 110 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2875 | 15.4% |
| d | 2275 | 12.1% |
| o | 2049 | 10.9% |
| e | 1994 | 10.6% |
| n | 1355 | 7.2% |
| a | 1052 | 5.6% |
| t | 826 | 4.4% |
| i | 749 | 4.0% |
| r | 716 | 3.8% |
| c | 694 | 3.7% |
| Other values (13) | 4141 | 22.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14441 | 77.1% |
| Space Separator | 2875 | 15.4% |
| Uppercase Letter | 1052 | 5.6% |
| Other Punctuation | 358 | 1.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 2275 | 15.8% |
| o | 2049 | 14.2% |
| e | 1994 | 13.8% |
| n | 1355 | 9.4% |
| a | 1052 | 7.3% |
| t | 826 | 5.7% |
| i | 749 | 5.2% |
| r | 716 | 5.0% |
| c | 694 | 4.8% |
| k | 639 | 4.4% |
| Other values (8) | 2092 | 14.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 639 | 60.7% |
| T | 358 | 34.0% |
| D | 55 | 5.2% |
Space Separator
| Value | Count | Frequency (%) |
| 2875 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 358 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15493 | 82.7% |
| Common | 3233 | 17.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 2275 | 14.7% |
| o | 2049 | 13.2% |
| e | 1994 | 12.9% |
| n | 1355 | 8.7% |
| a | 1052 | 6.8% |
| t | 826 | 5.3% |
| i | 749 | 4.8% |
| r | 716 | 4.6% |
| c | 694 | 4.5% |
| L | 639 | 4.1% |
| Other values (11) | 3144 | 20.3% |
Common
| Value | Count | Frequency (%) |
| 2875 | 88.9% |
| , | 358 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18726 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2875 | 15.4% |
| d | 2275 | 12.1% |
| o | 2049 | 10.9% |
| e | 1994 | 10.6% |
| n | 1355 | 7.2% |
| a | 1052 | 5.6% |
| t | 826 | 4.4% |
| i | 749 | 4.0% |
| r | 716 | 3.8% |
| c | 694 | 3.7% |
| Other values (13) | 4141 | 22.1% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 25 |
|---|
| Median length | 25 |
|---|
| Mean length | 22.87547529 |
|---|
| Min length | 18 |
|---|
Characters and Unicode
| Total characters | 24065 |
|---|
| Distinct characters | 21 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Read the brochures |
|---|
| 2nd row | Listen to recommendations |
|---|
| 3rd row | Listen to recommendations |
|---|
| 4th row | Listen to recommendations |
|---|
| 5th row | Listen to recommendations |
|---|
| Value | Count | Frequency (%) |
| listen | 539 | 17.1% |
| to | 539 | 17.1% |
| recommendations | 539 | 17.1% |
| the | 513 | 16.3% |
| imagine | 339 | 10.7% |
| experience | 339 | 10.7% |
| read | 174 | 5.5% |
| brochures | 174 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 4173 | 17.3% |
| n | 2295 | 9.5% |
| t | 2130 | 8.9% |
| 2104 | 8.7% |
| o | 1791 | 7.4% |
| i | 1756 | 7.3% |
| m | 1417 | 5.9% |
| s | 1252 | 5.2% |
| r | 1226 | 5.1% |
| c | 1052 | 4.4% |
| Other values (11) | 4869 | 20.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20909 | 86.9% |
| Space Separator | 2104 | 8.7% |
| Uppercase Letter | 1052 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4173 | 20.0% |
| n | 2295 | 11.0% |
| t | 2130 | 10.2% |
| o | 1791 | 8.6% |
| i | 1756 | 8.4% |
| m | 1417 | 6.8% |
| s | 1252 | 6.0% |
| r | 1226 | 5.9% |
| c | 1052 | 5.0% |
| a | 1052 | 5.0% |
| Other values (7) | 2765 | 13.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 539 | 51.2% |
| I | 339 | 32.2% |
| R | 174 | 16.5% |
Space Separator
| Value | Count | Frequency (%) |
| 2104 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21961 | 91.3% |
| Common | 2104 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4173 | 19.0% |
| n | 2295 | 10.5% |
| t | 2130 | 9.7% |
| o | 1791 | 8.2% |
| i | 1756 | 8.0% |
| m | 1417 | 6.5% |
| s | 1252 | 5.7% |
| r | 1226 | 5.6% |
| c | 1052 | 4.8% |
| a | 1052 | 4.8% |
| Other values (10) | 3817 | 17.4% |
Common
| Value | Count | Frequency (%) |
| 2104 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24065 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 4173 | 17.3% |
| n | 2295 | 9.5% |
| t | 2130 | 8.9% |
| 2104 | 8.7% |
| o | 1791 | 7.4% |
| i | 1756 | 7.3% |
| m | 1417 | 5.9% |
| s | 1252 | 5.2% |
| r | 1226 | 5.1% |
| c | 1052 | 4.4% |
| Other values (11) | 4869 | 20.2% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 25 |
|---|
| Median length | 16 |
|---|
| Mean length | 19.56368821 |
|---|
| Min length | 16 |
|---|
Characters and Unicode
| Total characters | 20581 |
|---|
| Distinct characters | 21 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Read the reviews |
|---|
| 2nd row | Test-drive what you fancy |
|---|
| 3rd row | Read the reviews |
|---|
| 4th row | Discuss with friends |
|---|
| 5th row | Test-drive what you fancy |
|---|
| Value | Count | Frequency (%) |
| read | 531 | 15.2% |
| the | 531 | 15.2% |
| reviews | 531 | 15.2% |
| test-drive | 333 | 9.5% |
| what | 333 | 9.5% |
| you | 333 | 9.5% |
| fancy | 333 | 9.5% |
| discuss | 188 | 5.4% |
| with | 188 | 5.4% |
| friends | 188 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2978 | 14.5% |
| 2437 | 11.8% |
| s | 1616 | 7.9% |
| i | 1428 | 6.9% |
| t | 1385 | 6.7% |
| a | 1197 | 5.8% |
| w | 1052 | 5.1% |
| d | 1052 | 5.1% |
| h | 1052 | 5.1% |
| r | 1052 | 5.1% |
| Other values (11) | 5332 | 25.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16759 | 81.4% |
| Space Separator | 2437 | 11.8% |
| Uppercase Letter | 1052 | 5.1% |
| Dash Punctuation | 333 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2978 | 17.8% |
| s | 1616 | 9.6% |
| i | 1428 | 8.5% |
| t | 1385 | 8.3% |
| a | 1197 | 7.1% |
| w | 1052 | 6.3% |
| d | 1052 | 6.3% |
| h | 1052 | 6.3% |
| r | 1052 | 6.3% |
| v | 864 | 5.2% |
| Other values (6) | 3083 | 18.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 531 | 50.5% |
| T | 333 | 31.7% |
| D | 188 | 17.9% |
Space Separator
| Value | Count | Frequency (%) |
| 2437 | 100.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 333 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17811 | 86.5% |
| Common | 2770 | 13.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2978 | 16.7% |
| s | 1616 | 9.1% |
| i | 1428 | 8.0% |
| t | 1385 | 7.8% |
| a | 1197 | 6.7% |
| w | 1052 | 5.9% |
| d | 1052 | 5.9% |
| h | 1052 | 5.9% |
| r | 1052 | 5.9% |
| v | 864 | 4.9% |
| Other values (9) | 4135 | 23.2% |
Common
| Value | Count | Frequency (%) |
| 2437 | 88.0% |
| - | 333 | 12.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20581 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2978 | 14.5% |
| 2437 | 11.8% |
| s | 1616 | 7.9% |
| i | 1428 | 6.9% |
| t | 1385 | 6.7% |
| a | 1197 | 5.8% |
| w | 1052 | 5.1% |
| d | 1052 | 5.1% |
| h | 1052 | 5.1% |
| r | 1052 | 5.1% |
| Other values (11) | 5332 | 25.9% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 65 |
|---|
| Median length | 64 |
|---|
| Mean length | 53.27851711 |
|---|
| Min length | 33 |
|---|
Characters and Unicode
| Total characters | 56049 |
|---|
| Distinct characters | 24 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | I talk through with the teacher exactly what I am supposed to do |
|---|
| 2nd row | I watch what the teacher is doing |
|---|
| 3rd row | I like to give it a try and work it out as I go along by doing it |
|---|
| 4th row | I talk through with the teacher exactly what I am supposed to do |
|---|
| 5th row | I like to give it a try and work it out as I go along by doing it |
|---|
| Value | Count | Frequency (%) |
| i | 1724 | 12.4% |
| it | 1503 | 10.8% |
| doing | 881 | 6.3% |
| to | 672 | 4.8% |
| teacher | 551 | 4.0% |
| the | 551 | 4.0% |
| what | 551 | 4.0% |
| go | 501 | 3.6% |
| like | 501 | 3.6% |
| along | 501 | 3.6% |
| Other values (17) | 5965 | 42.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 12849 | 22.9% |
| t | 5894 | 10.5% |
| o | 4070 | 7.3% |
| a | 3999 | 7.1% |
| i | 3937 | 7.0% |
| e | 2997 | 5.3% |
| g | 2555 | 4.6% |
| h | 2546 | 4.5% |
| n | 1883 | 3.4% |
| I | 1724 | 3.1% |
| Other values (14) | 13595 | 24.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 41476 | 74.0% |
| Space Separator | 12849 | 22.9% |
| Uppercase Letter | 1724 | 3.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 5894 | 14.2% |
| o | 4070 | 9.8% |
| a | 3999 | 9.6% |
| i | 3937 | 9.5% |
| e | 2997 | 7.2% |
| g | 2555 | 6.2% |
| h | 2546 | 6.1% |
| n | 1883 | 4.5% |
| r | 1724 | 4.2% |
| d | 1724 | 4.2% |
| Other values (12) | 10147 | 24.5% |
Space Separator
| Value | Count | Frequency (%) |
| 12849 | 100.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1724 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43200 | 77.1% |
| Common | 12849 | 22.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 5894 | 13.6% |
| o | 4070 | 9.4% |
| a | 3999 | 9.3% |
| i | 3937 | 9.1% |
| e | 2997 | 6.9% |
| g | 2555 | 5.9% |
| h | 2546 | 5.9% |
| n | 1883 | 4.4% |
| I | 1724 | 4.0% |
| r | 1724 | 4.0% |
| Other values (13) | 11871 | 27.5% |
Common
| Value | Count | Frequency (%) |
| 12849 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56049 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12849 | 22.9% |
| t | 5894 | 10.5% |
| o | 4070 | 7.3% |
| a | 3999 | 7.1% |
| i | 3937 | 7.0% |
| e | 2997 | 5.3% |
| g | 2555 | 4.6% |
| h | 2546 | 4.5% |
| n | 1883 | 3.4% |
| I | 1724 | 3.1% |
| Other values (14) | 13595 | 24.3% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 40 |
|---|
| Median length | 38 |
|---|
| Mean length | 38.61121673 |
|---|
| Min length | 37 |
|---|
Characters and Unicode
| Total characters | 40619 |
|---|
| Distinct characters | 21 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | I imagine what the food will look like |
|---|
| 2nd row | I imagine what the food will look like |
|---|
| 3rd row | I imagine what the food will look like |
|---|
| 4th row | I imagine what the food will taste like |
|---|
| 5th row | I imagine what the food will taste like |
|---|
| Value | Count | Frequency (%) |
| i | 1052 | 12.5% |
| the | 1052 | 12.5% |
| imagine | 673 | 8.0% |
| what | 673 | 8.0% |
| food | 673 | 8.0% |
| will | 673 | 8.0% |
| like | 673 | 8.0% |
| taste | 511 | 6.1% |
| talk | 379 | 4.5% |
| through | 379 | 4.5% |
| Other values (5) | 1678 | 19.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7875 | 19.4% |
| t | 3884 | 9.6% |
| i | 3450 | 8.5% |
| e | 3288 | 8.1% |
| h | 2862 | 7.0% |
| o | 2807 | 6.9% |
| a | 2615 | 6.4% |
| l | 2560 | 6.3% |
| n | 1431 | 3.5% |
| w | 1346 | 3.3% |
| Other values (11) | 8501 | 20.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 31692 | 78.0% |
| Space Separator | 7875 | 19.4% |
| Uppercase Letter | 1052 | 2.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3884 | 12.3% |
| i | 3450 | 10.9% |
| e | 3288 | 10.4% |
| h | 2862 | 9.0% |
| o | 2807 | 8.9% |
| a | 2615 | 8.3% |
| l | 2560 | 8.1% |
| n | 1431 | 4.5% |
| w | 1346 | 4.2% |
| k | 1214 | 3.8% |
| Other values (9) | 6235 | 19.7% |
Space Separator
| Value | Count | Frequency (%) |
| 7875 | 100.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1052 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 32744 | 80.6% |
| Common | 7875 | 19.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3884 | 11.9% |
| i | 3450 | 10.5% |
| e | 3288 | 10.0% |
| h | 2862 | 8.7% |
| o | 2807 | 8.6% |
| a | 2615 | 8.0% |
| l | 2560 | 7.8% |
| n | 1431 | 4.4% |
| w | 1346 | 4.1% |
| k | 1214 | 3.7% |
| Other values (10) | 7287 | 22.3% |
Common
| Value | Count | Frequency (%) |
| 7875 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40619 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7875 | 19.4% |
| t | 3884 | 9.6% |
| i | 3450 | 8.5% |
| e | 3288 | 8.1% |
| h | 2862 | 7.0% |
| o | 2807 | 6.9% |
| a | 2615 | 6.4% |
| l | 2560 | 6.3% |
| n | 1431 | 3.5% |
| w | 1346 | 3.3% |
| Other values (11) | 8501 | 20.9% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 51 |
|---|
| Median length | 36 |
|---|
| Mean length | 42.38022814 |
|---|
| Min length | 29 |
|---|
Characters and Unicode
| Total characters | 44584 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 5 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | I sing along to the lyrics (in my head or out loud) |
|---|
| 2nd row | I listen to the lyrics and the beats |
|---|
| 3rd row | I sing along to the lyrics (in my head or out loud) |
|---|
| 4th row | I listen to the lyrics and the beats |
|---|
| 5th row | I sing along to the lyrics (in my head or out loud) |
|---|
| Value | Count | Frequency (%) |
| the | 1416 | 13.7% |
| i | 1052 | 10.2% |
| to | 888 | 8.6% |
| lyrics | 888 | 8.6% |
| in | 688 | 6.6% |
| sing | 524 | 5.1% |
| loud | 524 | 5.1% |
| out | 524 | 5.1% |
| or | 524 | 5.1% |
| head | 524 | 5.1% |
| Other values (9) | 2796 | 27.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9296 | 20.9% |
| t | 3884 | 8.7% |
| o | 3148 | 7.1% |
| e | 2996 | 6.7% |
| i | 2956 | 6.6% |
| n | 2464 | 5.5% |
| s | 2304 | 5.2% |
| l | 2300 | 5.2% |
| h | 2104 | 4.7% |
| a | 1776 | 4.0% |
| Other values (13) | 11356 | 25.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33188 | 74.4% |
| Space Separator | 9296 | 20.9% |
| Uppercase Letter | 1052 | 2.4% |
| Open Punctuation | 524 | 1.2% |
| Close Punctuation | 524 | 1.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3884 | 11.7% |
| o | 3148 | 9.5% |
| e | 2996 | 9.0% |
| i | 2956 | 8.9% |
| n | 2464 | 7.4% |
| s | 2304 | 6.9% |
| l | 2300 | 6.9% |
| h | 2104 | 6.3% |
| a | 1776 | 5.4% |
| d | 1412 | 4.3% |
| Other values (9) | 7844 | 23.6% |
Space Separator
| Value | Count | Frequency (%) |
| 9296 | 100.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1052 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 524 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 524 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34240 | 76.8% |
| Common | 10344 | 23.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3884 | 11.3% |
| o | 3148 | 9.2% |
| e | 2996 | 8.8% |
| i | 2956 | 8.6% |
| n | 2464 | 7.2% |
| s | 2304 | 6.7% |
| l | 2300 | 6.7% |
| h | 2104 | 6.1% |
| a | 1776 | 5.2% |
| d | 1412 | 4.1% |
| Other values (10) | 8896 | 26.0% |
Common
| Value | Count | Frequency (%) |
| 9296 | 89.9% |
| ( | 524 | 5.1% |
| ) | 524 | 5.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44584 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9296 | 20.9% |
| t | 3884 | 8.7% |
| o | 3148 | 7.1% |
| e | 2996 | 6.7% |
| i | 2956 | 6.6% |
| n | 2464 | 5.5% |
| s | 2304 | 5.2% |
| l | 2300 | 5.2% |
| h | 2104 | 4.7% |
| a | 1776 | 4.0% |
| Other values (13) | 11356 | 25.5% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 74 |
|---|
| Median length | 53 |
|---|
| Mean length | 52.83365019 |
|---|
| Min length | 45 |
|---|
Characters and Unicode
| Total characters | 55581 |
|---|
| Distinct characters | 26 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Focus on the words or pictures in front of me |
|---|
| 2nd row | Focus on the words or pictures in front of me |
|---|
| 3rd row | Discuss the problem and possible solutions in my head |
|---|
| 4th row | Focus on the words or pictures in front of me |
|---|
| 5th row | Discuss the problem and possible solutions in my head |
|---|
| Value | Count | Frequency (%) |
| the | 863 | 8.0% |
| in | 863 | 8.0% |
| and | 723 | 6.7% |
| focus | 518 | 4.8% |
| on | 518 | 4.8% |
| words | 518 | 4.8% |
| or | 518 | 4.8% |
| pictures | 518 | 4.8% |
| front | 518 | 4.8% |
| of | 518 | 4.8% |
| Other values (18) | 4667 | 43.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9690 | 17.4% |
| o | 5244 | 9.4% |
| s | 4536 | 8.2% |
| e | 4068 | 7.3% |
| n | 3912 | 7.0% |
| t | 3189 | 5.7% |
| i | 3172 | 5.7% |
| r | 2795 | 5.0% |
| d | 2342 | 4.2% |
| u | 2293 | 4.1% |
| Other values (16) | 14340 | 25.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44650 | 80.3% |
| Space Separator | 9690 | 17.4% |
| Uppercase Letter | 1052 | 1.9% |
| Other Punctuation | 189 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5244 | 11.7% |
| s | 4536 | 10.2% |
| e | 4068 | 9.1% |
| n | 3912 | 8.8% |
| t | 3189 | 7.1% |
| i | 3172 | 7.1% |
| r | 2795 | 6.3% |
| d | 2342 | 5.2% |
| u | 2293 | 5.1% |
| l | 1791 | 4.0% |
| Other values (11) | 11308 | 25.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 518 | 49.2% |
| D | 345 | 32.8% |
| M | 189 | 18.0% |
Space Separator
| Value | Count | Frequency (%) |
| 9690 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 189 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45702 | 82.2% |
| Common | 9879 | 17.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 5244 | 11.5% |
| s | 4536 | 9.9% |
| e | 4068 | 8.9% |
| n | 3912 | 8.6% |
| t | 3189 | 7.0% |
| i | 3172 | 6.9% |
| r | 2795 | 6.1% |
| d | 2342 | 5.1% |
| u | 2293 | 5.0% |
| l | 1791 | 3.9% |
| Other values (14) | 12360 | 27.0% |
Common
| Value | Count | Frequency (%) |
| 9690 | 98.1% |
| , | 189 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55581 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9690 | 17.4% |
| o | 5244 | 9.4% |
| s | 4536 | 8.2% |
| e | 4068 | 7.3% |
| n | 3912 | 7.0% |
| t | 3189 | 5.7% |
| i | 3172 | 5.7% |
| r | 2795 | 5.0% |
| d | 2342 | 4.2% |
| u | 2293 | 4.1% |
| Other values (16) | 14340 | 25.8% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 62 |
|---|
| Median length | 60 |
|---|
| Mean length | 52.36882129 |
|---|
| Min length | 40 |
|---|
Characters and Unicode
| Total characters | 55092 |
|---|
| Distinct characters | 25 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Saying them aloud or repeating words and key points in my head |
|---|
| 2nd row | Writing notes or keeping printed details |
|---|
| 3rd row | Writing notes or keeping printed details |
|---|
| 4th row | Writing notes or keeping printed details |
|---|
| 5th row | Writing notes or keeping printed details |
|---|
| Value | Count | Frequency (%) |
| or | 1052 | 11.2% |
| and | 623 | 6.7% |
| writing | 429 | 4.6% |
| keeping | 429 | 4.6% |
| printed | 429 | 4.6% |
| details | 429 | 4.6% |
| notes | 429 | 4.6% |
| imagining | 347 | 3.7% |
| done | 347 | 3.7% |
| being | 347 | 3.7% |
| Other values (15) | 4495 | 48.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 8304 | 15.1% |
| i | 6719 | 12.2% |
| n | 5525 | 10.0% |
| e | 4566 | 8.3% |
| t | 4279 | 7.8% |
| a | 3197 | 5.8% |
| g | 3145 | 5.7% |
| o | 3003 | 5.5% |
| r | 2809 | 5.1% |
| d | 2656 | 4.8% |
| Other values (15) | 10889 | 19.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 45736 | 83.0% |
| Space Separator | 8304 | 15.1% |
| Uppercase Letter | 1052 | 1.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6719 | 14.7% |
| n | 5525 | 12.1% |
| e | 4566 | 10.0% |
| t | 4279 | 9.4% |
| a | 3197 | 7.0% |
| g | 3145 | 6.9% |
| o | 3003 | 6.6% |
| r | 2809 | 6.1% |
| d | 2656 | 5.8% |
| p | 1757 | 3.8% |
| Other values (11) | 8080 | 17.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 429 | 40.8% |
| D | 347 | 33.0% |
| S | 276 | 26.2% |
Space Separator
| Value | Count | Frequency (%) |
| 8304 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 46788 | 84.9% |
| Common | 8304 | 15.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 6719 | 14.4% |
| n | 5525 | 11.8% |
| e | 4566 | 9.8% |
| t | 4279 | 9.1% |
| a | 3197 | 6.8% |
| g | 3145 | 6.7% |
| o | 3003 | 6.4% |
| r | 2809 | 6.0% |
| d | 2656 | 5.7% |
| p | 1757 | 3.8% |
| Other values (14) | 9132 | 19.5% |
Common
| Value | Count | Frequency (%) |
| 8304 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55092 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8304 | 15.1% |
| i | 6719 | 12.2% |
| n | 5525 | 10.0% |
| e | 4566 | 8.3% |
| t | 4279 | 7.8% |
| a | 3197 | 5.8% |
| g | 3145 | 5.7% |
| o | 3003 | 5.5% |
| r | 2809 | 5.1% |
| d | 2656 | 4.8% |
| Other values (15) | 10889 | 19.8% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 20 |
|---|
| Median length | 15 |
|---|
| Mean length | 17.2338403 |
|---|
| Min length | 15 |
|---|
Characters and Unicode
| Total characters | 18130 |
|---|
| Distinct characters | 16 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Looking at something |
|---|
| 2nd row | Looking at something |
|---|
| 3rd row | Doing something |
|---|
| 4th row | Being spoken to |
|---|
| 5th row | Doing something |
|---|
| Value | Count | Frequency (%) |
| something | 932 | 34.6% |
| looking | 470 | 17.4% |
| at | 470 | 17.4% |
| doing | 462 | 17.1% |
| being | 120 | 4.5% |
| spoken | 120 | 4.5% |
| to | 120 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2574 | 14.2% |
| n | 2104 | 11.6% |
| i | 1984 | 10.9% |
| g | 1984 | 10.9% |
| 1642 | 9.1% |
| t | 1522 | 8.4% |
| e | 1172 | 6.5% |
| s | 1052 | 5.8% |
| m | 932 | 5.1% |
| h | 932 | 5.1% |
| Other values (6) | 2232 | 12.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15436 | 85.1% |
| Space Separator | 1642 | 9.1% |
| Uppercase Letter | 1052 | 5.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2574 | 16.7% |
| n | 2104 | 13.6% |
| i | 1984 | 12.9% |
| g | 1984 | 12.9% |
| t | 1522 | 9.9% |
| e | 1172 | 7.6% |
| s | 1052 | 6.8% |
| m | 932 | 6.0% |
| h | 932 | 6.0% |
| k | 590 | 3.8% |
| Other values (2) | 590 | 3.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 470 | 44.7% |
| D | 462 | 43.9% |
| B | 120 | 11.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1642 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16488 | 90.9% |
| Common | 1642 | 9.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2574 | 15.6% |
| n | 2104 | 12.8% |
| i | 1984 | 12.0% |
| g | 1984 | 12.0% |
| t | 1522 | 9.2% |
| e | 1172 | 7.1% |
| s | 1052 | 6.4% |
| m | 932 | 5.7% |
| h | 932 | 5.7% |
| k | 590 | 3.6% |
| Other values (5) | 1642 | 10.0% |
Common
| Value | Count | Frequency (%) |
| 1642 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18130 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2574 | 14.2% |
| n | 2104 | 11.6% |
| i | 1984 | 10.9% |
| g | 1984 | 10.9% |
| 1642 | 9.1% |
| t | 1522 | 8.4% |
| e | 1172 | 6.5% |
| s | 1052 | 5.8% |
| m | 932 | 5.1% |
| h | 932 | 5.1% |
| Other values (6) | 2232 | 12.3% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 50 |
|---|
| Median length | 41 |
|---|
| Mean length | 40.64828897 |
|---|
| Min length | 34 |
|---|
Characters and Unicode
| Total characters | 42762 |
|---|
| Distinct characters | 26 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Visualize the worst case scenarios |
|---|
| 2nd row | Visualize the worst case scenarios |
|---|
| 3rd row | Visualize the worst case scenarios |
|---|
| 4th row | Talk over in my head what worries me most |
|---|
| 5th row | Talk over in my head what worries me most |
|---|
| Value | Count | Frequency (%) |
| talk | 478 | 6.1% |
| over | 478 | 6.1% |
| in | 478 | 6.1% |
| my | 478 | 6.1% |
| head | 478 | 6.1% |
| what | 478 | 6.1% |
| worries | 478 | 6.1% |
| me | 478 | 6.1% |
| most | 478 | 6.1% |
| scenarios | 346 | 4.4% |
| Other values (12) | 3208 | 40.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6804 | 15.9% |
| e | 3752 | 8.8% |
| a | 3384 | 7.9% |
| s | 3370 | 7.9% |
| o | 2810 | 6.6% |
| t | 2788 | 6.5% |
| i | 2678 | 6.3% |
| r | 2354 | 5.5% |
| n | 1964 | 4.6% |
| l | 1736 | 4.1% |
| Other values (16) | 11122 | 26.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34450 | 80.6% |
| Space Separator | 6804 | 15.9% |
| Uppercase Letter | 1052 | 2.5% |
| Other Punctuation | 456 | 1.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3752 | 10.9% |
| a | 3384 | 9.8% |
| s | 3370 | 9.8% |
| o | 2810 | 8.2% |
| t | 2788 | 8.1% |
| i | 2678 | 7.8% |
| r | 2354 | 6.8% |
| n | 1964 | 5.7% |
| l | 1736 | 5.0% |
| m | 1662 | 4.8% |
| Other values (10) | 7952 | 23.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 478 | 45.4% |
| V | 346 | 32.9% |
| C | 228 | 21.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 228 | 50.0% |
| , | 228 | 50.0% |
Space Separator
| Value | Count | Frequency (%) |
| 6804 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 35502 | 83.0% |
| Common | 7260 | 17.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3752 | 10.6% |
| a | 3384 | 9.5% |
| s | 3370 | 9.5% |
| o | 2810 | 7.9% |
| t | 2788 | 7.9% |
| i | 2678 | 7.5% |
| r | 2354 | 6.6% |
| n | 1964 | 5.5% |
| l | 1736 | 4.9% |
| m | 1662 | 4.7% |
| Other values (13) | 9004 | 25.4% |
Common
| Value | Count | Frequency (%) |
| 6804 | 93.7% |
| ' | 228 | 3.1% |
| , | 228 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42762 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6804 | 15.9% |
| e | 3752 | 8.8% |
| a | 3384 | 7.9% |
| s | 3370 | 7.9% |
| o | 2810 | 6.6% |
| t | 2788 | 6.5% |
| i | 2678 | 6.3% |
| r | 2354 | 5.5% |
| n | 1964 | 4.6% |
| l | 1736 | 4.1% |
| Other values (16) | 11122 | 26.0% |
| Distinct | 4 |
|---|
| Distinct (%) | 0.4% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 19 |
|---|
| Median length | 16 |
|---|
| Mean length | 16.93821293 |
|---|
| Min length | 13 |
|---|
Characters and Unicode
| Total characters | 17819 |
|---|
| Distinct characters | 14 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | What they say to me |
|---|
| 2nd row | How they look |
|---|
| 3rd row | Hoe they make me |
|---|
| 4th row | What they say to me |
|---|
| 5th row | What they say to me |
|---|
| Value | Count | Frequency (%) |
| they | 1052 | 23.2% |
| me | 944 | 20.8% |
| how | 605 | 13.3% |
| make | 507 | 11.2% |
| what | 437 | 9.6% |
| say | 437 | 9.6% |
| to | 437 | 9.6% |
| look | 108 | 2.4% |
| hoe | 10 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3485 | 19.6% |
| e | 2513 | 14.1% |
| t | 1926 | 10.8% |
| h | 1489 | 8.4% |
| y | 1489 | 8.4% |
| m | 1451 | 8.1% |
| a | 1381 | 7.8% |
| o | 1268 | 7.1% |
| H | 615 | 3.5% |
| k | 615 | 3.5% |
| Other values (4) | 1587 | 8.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13282 | 74.5% |
| Space Separator | 3485 | 19.6% |
| Uppercase Letter | 1052 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2513 | 18.9% |
| t | 1926 | 14.5% |
| h | 1489 | 11.2% |
| y | 1489 | 11.2% |
| m | 1451 | 10.9% |
| a | 1381 | 10.4% |
| o | 1268 | 9.5% |
| k | 615 | 4.6% |
| w | 605 | 4.6% |
| s | 437 | 3.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 615 | 58.5% |
| W | 437 | 41.5% |
Space Separator
| Value | Count | Frequency (%) |
| 3485 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14334 | 80.4% |
| Common | 3485 | 19.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2513 | 17.5% |
| t | 1926 | 13.4% |
| h | 1489 | 10.4% |
| y | 1489 | 10.4% |
| m | 1451 | 10.1% |
| a | 1381 | 9.6% |
| o | 1268 | 8.8% |
| H | 615 | 4.3% |
| k | 615 | 4.3% |
| w | 605 | 4.2% |
| Other values (3) | 982 | 6.9% |
Common
| Value | Count | Frequency (%) |
| 3485 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17819 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3485 | 19.6% |
| e | 2513 | 14.1% |
| t | 1926 | 10.8% |
| h | 1489 | 8.4% |
| y | 1489 | 8.4% |
| m | 1451 | 8.1% |
| a | 1381 | 7.8% |
| o | 1268 | 7.1% |
| H | 615 | 3.5% |
| k | 615 | 3.5% |
| Other values (4) | 1587 | 8.9% |
| Distinct | 4 |
|---|
| Distinct (%) | 0.4% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 52 |
|---|
| Median length | 51 |
|---|
| Mean length | 42.26330798 |
|---|
| Min length | 28 |
|---|
Characters and Unicode
| Total characters | 44461 |
|---|
| Distinct characters | 26 |
|---|
| Distinct categories | 6 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Write lots of revision notes (using lots of colors!) |
|---|
| 2nd row | I talk over my notes, to myself or to other people |
|---|
| 3rd row | I talk over my notes, to myself or to other people |
|---|
| 4th row | Write lots of revision notes (using lots of colors!) |
|---|
| 5th row | Write lots of revision notes (using lots of colors!) |
|---|
| Value | Count | Frequency (%) |
| to | 974 | 11.1% |
| notes | 879 | 10.0% |
| or | 660 | 7.5% |
| i | 487 | 5.6% |
| talk | 487 | 5.6% |
| over | 487 | 5.6% |
| my | 487 | 5.6% |
| myself | 487 | 5.6% |
| other | 487 | 5.6% |
| people | 487 | 5.6% |
| Other values (12) | 2831 | 32.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7701 | 17.3% |
| o | 5548 | 12.5% |
| e | 5136 | 11.6% |
| t | 4316 | 9.7% |
| r | 2777 | 6.2% |
| s | 2189 | 4.9% |
| l | 2052 | 4.6% |
| n | 1976 | 4.4% |
| m | 1839 | 4.1% |
| i | 1708 | 3.8% |
| Other values (16) | 9219 | 20.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35182 | 79.1% |
| Space Separator | 7701 | 17.3% |
| Uppercase Letter | 1052 | 2.4% |
| Other Punctuation | 500 | 1.1% |
| Open Punctuation | 13 | < 0.1% |
| Close Punctuation | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5548 | 15.8% |
| e | 5136 | 14.6% |
| t | 4316 | 12.3% |
| r | 2777 | 7.9% |
| s | 2189 | 6.2% |
| l | 2052 | 5.8% |
| n | 1976 | 5.6% |
| m | 1839 | 5.2% |
| i | 1708 | 4.9% |
| a | 1179 | 3.4% |
| Other values (9) | 6462 | 18.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 660 | 62.7% |
| W | 392 | 37.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 487 | 97.4% |
| ! | 13 | 2.6% |
Space Separator
| Value | Count | Frequency (%) |
| 7701 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36234 | 81.5% |
| Common | 8227 | 18.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 5548 | 15.3% |
| e | 5136 | 14.2% |
| t | 4316 | 11.9% |
| r | 2777 | 7.7% |
| s | 2189 | 6.0% |
| l | 2052 | 5.7% |
| n | 1976 | 5.5% |
| m | 1839 | 5.1% |
| i | 1708 | 4.7% |
| a | 1179 | 3.3% |
| Other values (11) | 7514 | 20.7% |
Common
| Value | Count | Frequency (%) |
| 7701 | 93.6% |
| , | 487 | 5.9% |
| ( | 13 | 0.2% |
| ! | 13 | 0.2% |
| ) | 13 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44461 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7701 | 17.3% |
| o | 5548 | 12.5% |
| e | 5136 | 11.6% |
| t | 4316 | 9.7% |
| r | 2777 | 6.2% |
| s | 2189 | 4.9% |
| l | 2052 | 4.6% |
| n | 1976 | 4.4% |
| m | 1839 | 4.1% |
| i | 1708 | 3.8% |
| Other values (16) | 9219 | 20.7% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 64 |
|---|
| Median length | 55 |
|---|
| Mean length | 46.30418251 |
|---|
| Min length | 21 |
|---|
Characters and Unicode
| Total characters | 48712 |
|---|
| Distinct characters | 25 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Encourage them to try and talk them through the idea as they try |
|---|
| 2nd row | Encourage them to try and talk them through the idea as they try |
|---|
| 3rd row | Explain to them in different ways until they understand |
|---|
| 4th row | Encourage them to try and talk them through the idea as they try |
|---|
| 5th row | Explain to them in different ways until they understand |
|---|
| Value | Count | Frequency (%) |
| them | 1218 | 13.7% |
| they | 739 | 8.3% |
| to | 739 | 8.3% |
| explain | 573 | 6.5% |
| in | 573 | 6.5% |
| different | 573 | 6.5% |
| ways | 573 | 6.5% |
| until | 573 | 6.5% |
| understand | 573 | 6.5% |
| try | 332 | 3.7% |
| Other values (11) | 2414 | 27.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7828 | 16.1% |
| t | 5558 | 11.4% |
| e | 4487 | 9.2% |
| n | 4083 | 8.4% |
| a | 3175 | 6.5% |
| h | 3081 | 6.3% |
| i | 2458 | 5.0% |
| d | 2051 | 4.2% |
| r | 1810 | 3.7% |
| y | 1644 | 3.4% |
| Other values (15) | 12537 | 25.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39519 | 81.1% |
| Space Separator | 7828 | 16.1% |
| Uppercase Letter | 1365 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 5558 | 14.1% |
| e | 4487 | 11.4% |
| n | 4083 | 10.3% |
| a | 3175 | 8.0% |
| h | 3081 | 7.8% |
| i | 2458 | 6.2% |
| d | 2051 | 5.2% |
| r | 1810 | 4.6% |
| y | 1644 | 4.2% |
| m | 1531 | 3.9% |
| Other values (11) | 9641 | 24.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 739 | 54.1% |
| S | 313 | 22.9% |
| I | 313 | 22.9% |
Space Separator
| Value | Count | Frequency (%) |
| 7828 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40884 | 83.9% |
| Common | 7828 | 16.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 5558 | 13.6% |
| e | 4487 | 11.0% |
| n | 4083 | 10.0% |
| a | 3175 | 7.8% |
| h | 3081 | 7.5% |
| i | 2458 | 6.0% |
| d | 2051 | 5.0% |
| r | 1810 | 4.4% |
| y | 1644 | 4.0% |
| m | 1531 | 3.7% |
| Other values (14) | 11006 | 26.9% |
Common
| Value | Count | Frequency (%) |
| 7828 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48712 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7828 | 16.1% |
| t | 5558 | 11.4% |
| e | 4487 | 9.2% |
| n | 4083 | 8.4% |
| a | 3175 | 6.5% |
| h | 3081 | 6.3% |
| i | 2458 | 5.0% |
| d | 2051 | 4.2% |
| r | 1810 | 3.7% |
| y | 1644 | 3.4% |
| Other values (15) | 12537 | 25.7% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 66 |
|---|
| Median length | 63 |
|---|
| Mean length | 59.94296578 |
|---|
| Min length | 48 |
|---|
Characters and Unicode
| Total characters | 63060 |
|---|
| Distinct characters | 26 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Listening to music or listening to the radio or talking to friends |
|---|
| 2nd row | Photography or watching films or people watching |
|---|
| 3rd row | Physical/sports activities or fine wines, fine foods or dancing |
|---|
| 4th row | Listening to music or listening to the radio or talking to friends |
|---|
| 5th row | Listening to music or listening to the radio or talking to friends |
|---|
| Value | Count | Frequency (%) |
| or | 2104 | 20.5% |
| to | 1419 | 13.8% |
| listening | 946 | 9.2% |
| watching | 618 | 6.0% |
| fine | 540 | 5.3% |
| the | 473 | 4.6% |
| radio | 473 | 4.6% |
| talking | 473 | 4.6% |
| friends | 473 | 4.6% |
| music | 473 | 4.6% |
| Other values (8) | 2277 | 22.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 9217 | 14.6% |
| i | 6871 | 10.9% |
| o | 5733 | 9.1% |
| t | 5048 | 8.0% |
| n | 4806 | 7.6% |
| s | 3821 | 6.1% |
| r | 3629 | 5.8% |
| e | 3590 | 5.7% |
| a | 2683 | 4.3% |
| g | 2616 | 4.1% |
| Other values (16) | 15046 | 23.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 52251 | 82.9% |
| Space Separator | 9217 | 14.6% |
| Uppercase Letter | 1052 | 1.7% |
| Other Punctuation | 540 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6871 | 13.1% |
| o | 5733 | 11.0% |
| t | 5048 | 9.7% |
| n | 4806 | 9.2% |
| s | 3821 | 7.3% |
| r | 3629 | 6.9% |
| e | 3590 | 6.9% |
| a | 2683 | 5.1% |
| g | 2616 | 5.0% |
| h | 1979 | 3.8% |
| Other values (11) | 11475 | 22.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 579 | 55.0% |
| L | 473 | 45.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 270 | 50.0% |
| , | 270 | 50.0% |
Space Separator
| Value | Count | Frequency (%) |
| 9217 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 53303 | 84.5% |
| Common | 9757 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 6871 | 12.9% |
| o | 5733 | 10.8% |
| t | 5048 | 9.5% |
| n | 4806 | 9.0% |
| s | 3821 | 7.2% |
| r | 3629 | 6.8% |
| e | 3590 | 6.7% |
| a | 2683 | 5.0% |
| g | 2616 | 4.9% |
| h | 1979 | 3.7% |
| Other values (13) | 12527 | 23.5% |
Common
| Value | Count | Frequency (%) |
| 9217 | 94.5% |
| / | 270 | 2.8% |
| , | 270 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 63060 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9217 | 14.6% |
| i | 6871 | 10.9% |
| o | 5733 | 9.1% |
| t | 5048 | 8.0% |
| n | 4806 | 7.6% |
| s | 3821 | 6.1% |
| r | 3629 | 5.8% |
| e | 3590 | 5.7% |
| a | 2683 | 4.3% |
| g | 2616 | 4.1% |
| Other values (16) | 15046 | 23.9% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 40 |
|---|
| Median length | 19 |
|---|
| Mean length | 25.85741445 |
|---|
| Min length | 18 |
|---|
Characters and Unicode
| Total characters | 27202 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Talking to friends |
|---|
| 2nd row | Watching television |
|---|
| 3rd row | Doing physical activity or making things |
|---|
| 4th row | Watching television |
|---|
| 5th row | Doing physical activity or making things |
|---|
| Value | Count | Frequency (%) |
| watching | 500 | 13.5% |
| television | 500 | 13.5% |
| doing | 353 | 9.5% |
| physical | 353 | 9.5% |
| activity | 353 | 9.5% |
| or | 353 | 9.5% |
| making | 353 | 9.5% |
| things | 353 | 9.5% |
| talking | 199 | 5.4% |
| to | 199 | 5.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4016 | 14.8% |
| 2663 | 9.8% |
| n | 2457 | 9.0% |
| t | 2258 | 8.3% |
| g | 1758 | 6.5% |
| a | 1758 | 6.5% |
| s | 1405 | 5.2% |
| o | 1405 | 5.2% |
| c | 1206 | 4.4% |
| h | 1206 | 4.4% |
| Other values (13) | 7070 | 26.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23487 | 86.3% |
| Space Separator | 2663 | 9.8% |
| Uppercase Letter | 1052 | 3.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 4016 | 17.1% |
| n | 2457 | 10.5% |
| t | 2258 | 9.6% |
| g | 1758 | 7.5% |
| a | 1758 | 7.5% |
| s | 1405 | 6.0% |
| o | 1405 | 6.0% |
| c | 1206 | 5.1% |
| h | 1206 | 5.1% |
| e | 1199 | 5.1% |
| Other values (9) | 4819 | 20.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 500 | 47.5% |
| D | 353 | 33.6% |
| T | 199 | 18.9% |
Space Separator
| Value | Count | Frequency (%) |
| 2663 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24539 | 90.2% |
| Common | 2663 | 9.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 4016 | 16.4% |
| n | 2457 | 10.0% |
| t | 2258 | 9.2% |
| g | 1758 | 7.2% |
| a | 1758 | 7.2% |
| s | 1405 | 5.7% |
| o | 1405 | 5.7% |
| c | 1206 | 4.9% |
| h | 1206 | 4.9% |
| e | 1199 | 4.9% |
| Other values (12) | 5871 | 23.9% |
Common
| Value | Count | Frequency (%) |
| 2663 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27202 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 4016 | 14.8% |
| 2663 | 9.8% |
| n | 2457 | 9.0% |
| t | 2258 | 8.3% |
| g | 1758 | 6.5% |
| a | 1758 | 6.5% |
| s | 1405 | 5.2% |
| o | 1405 | 5.2% |
| c | 1206 | 4.4% |
| h | 1206 | 4.4% |
| Other values (13) | 7070 | 26.0% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 42 |
|---|
| Median length | 32 |
|---|
| Mean length | 35.15209125 |
|---|
| Min length | 31 |
|---|
Characters and Unicode
| Total characters | 36980 |
|---|
| Distinct characters | 20 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | I arrange a face to face meeting |
|---|
| 2nd row | I talk to them on the telephone |
|---|
| 3rd row | I arrange a face to face meeting |
|---|
| 4th row | I arrange a face to face meeting |
|---|
| 5th row | I talk to them on the telephone |
|---|
| Value | Count | Frequency (%) |
| to | 1428 | 17.6% |
| i | 1052 | 13.0% |
| face | 464 | 5.7% |
| them | 444 | 5.5% |
| on | 444 | 5.5% |
| the | 444 | 5.5% |
| telephone | 444 | 5.5% |
| talk | 444 | 5.5% |
| an | 376 | 4.6% |
| activity | 376 | 4.6% |
| Other values (7) | 2200 | 27.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7064 | 19.1% |
| t | 5692 | 15.4% |
| e | 4884 | 13.2% |
| a | 2732 | 7.4% |
| o | 2692 | 7.3% |
| h | 2084 | 5.6% |
| n | 1728 | 4.7% |
| r | 1592 | 4.3% |
| g | 1216 | 3.3% |
| I | 1052 | 2.8% |
| Other values (10) | 6244 | 16.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28864 | 78.1% |
| Space Separator | 7064 | 19.1% |
| Uppercase Letter | 1052 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 5692 | 19.7% |
| e | 4884 | 16.9% |
| a | 2732 | 9.5% |
| o | 2692 | 9.3% |
| h | 2084 | 7.2% |
| n | 1728 | 6.0% |
| r | 1592 | 5.5% |
| g | 1216 | 4.2% |
| i | 984 | 3.4% |
| l | 888 | 3.1% |
| Other values (8) | 4372 | 15.1% |
Space Separator
| Value | Count | Frequency (%) |
| 7064 | 100.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1052 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29916 | 80.9% |
| Common | 7064 | 19.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 5692 | 19.0% |
| e | 4884 | 16.3% |
| a | 2732 | 9.1% |
| o | 2692 | 9.0% |
| h | 2084 | 7.0% |
| n | 1728 | 5.8% |
| r | 1592 | 5.3% |
| g | 1216 | 4.1% |
| I | 1052 | 3.5% |
| i | 984 | 3.3% |
| Other values (9) | 5260 | 17.6% |
Common
| Value | Count | Frequency (%) |
| 7064 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36980 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7064 | 19.1% |
| t | 5692 | 15.4% |
| e | 4884 | 13.2% |
| a | 2732 | 7.4% |
| o | 2692 | 7.3% |
| h | 2084 | 5.6% |
| n | 1728 | 4.7% |
| r | 1592 | 4.3% |
| g | 1216 | 3.3% |
| I | 1052 | 2.8% |
| Other values (10) | 6244 | 16.9% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 15 |
|---|
| Median length | 14 |
|---|
| Mean length | 14.43060837 |
|---|
| Min length | 14 |
|---|
Characters and Unicode
| Total characters | 15181 |
|---|
| Distinct characters | 16 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Look and dress |
|---|
| 2nd row | Stand and move |
|---|
| 3rd row | Sound and speak |
|---|
| 4th row | Look and dress |
|---|
| 5th row | Sound and speak |
|---|
| Value | Count | Frequency (%) |
| and | 1052 | 33.3% |
| look | 504 | 16.0% |
| dress | 504 | 16.0% |
| sound | 453 | 14.4% |
| speak | 453 | 14.4% |
| stand | 95 | 3.0% |
| move | 95 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2104 | 13.9% |
| d | 2104 | 13.9% |
| a | 1600 | 10.5% |
| n | 1600 | 10.5% |
| o | 1556 | 10.2% |
| s | 1461 | 9.6% |
| e | 1052 | 6.9% |
| k | 957 | 6.3% |
| S | 548 | 3.6% |
| L | 504 | 3.3% |
| Other values (6) | 1695 | 11.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12025 | 79.2% |
| Space Separator | 2104 | 13.9% |
| Uppercase Letter | 1052 | 6.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 2104 | 17.5% |
| a | 1600 | 13.3% |
| n | 1600 | 13.3% |
| o | 1556 | 12.9% |
| s | 1461 | 12.1% |
| e | 1052 | 8.7% |
| k | 957 | 8.0% |
| r | 504 | 4.2% |
| u | 453 | 3.8% |
| p | 453 | 3.8% |
| Other values (3) | 285 | 2.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 548 | 52.1% |
| L | 504 | 47.9% |
Space Separator
| Value | Count | Frequency (%) |
| 2104 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13077 | 86.1% |
| Common | 2104 | 13.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| d | 2104 | 16.1% |
| a | 1600 | 12.2% |
| n | 1600 | 12.2% |
| o | 1556 | 11.9% |
| s | 1461 | 11.2% |
| e | 1052 | 8.0% |
| k | 957 | 7.3% |
| S | 548 | 4.2% |
| L | 504 | 3.9% |
| r | 504 | 3.9% |
| Other values (5) | 1191 | 9.1% |
Common
| Value | Count | Frequency (%) |
| 2104 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15181 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2104 | 13.9% |
| d | 2104 | 13.9% |
| a | 1600 | 10.5% |
| n | 1600 | 10.5% |
| o | 1556 | 10.2% |
| s | 1461 | 9.6% |
| e | 1052 | 6.9% |
| k | 957 | 6.3% |
| S | 548 | 3.6% |
| L | 504 | 3.3% |
| Other values (6) | 1695 | 11.2% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 56 |
|---|
| Median length | 56 |
|---|
| Mean length | 51.92585551 |
|---|
| Min length | 39 |
|---|
Characters and Unicode
| Total characters | 54626 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | I keep replaying in my mind what it is that has upset me |
|---|
| 2nd row | I stomp about, slam doors and throw things |
|---|
| 3rd row | I keep replaying in my mind what it is that has upset me |
|---|
| 4th row | I keep replaying in my mind what it is that has upset me |
|---|
| 5th row | I keep replaying in my mind what it is that has upset me |
|---|
| Value | Count | Frequency (%) |
| i | 1202 | 9.6% |
| it | 778 | 6.2% |
| keep | 778 | 6.2% |
| me | 778 | 6.2% |
| upset | 778 | 6.2% |
| has | 778 | 6.2% |
| that | 778 | 6.2% |
| is | 778 | 6.2% |
| what | 778 | 6.2% |
| mind | 778 | 6.2% |
| Other values (16) | 4252 | 34.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 11404 | 20.9% |
| t | 4836 | 8.9% |
| e | 4640 | 8.5% |
| i | 4014 | 7.3% |
| a | 3634 | 6.7% |
| s | 3130 | 5.7% |
| h | 2882 | 5.3% |
| p | 2758 | 5.0% |
| n | 2732 | 5.0% |
| m | 2582 | 4.7% |
| Other values (13) | 12014 | 22.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 41896 | 76.7% |
| Space Separator | 11404 | 20.9% |
| Uppercase Letter | 1202 | 2.2% |
| Other Punctuation | 124 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4836 | 11.5% |
| e | 4640 | 11.1% |
| i | 4014 | 9.6% |
| a | 3634 | 8.7% |
| s | 3130 | 7.5% |
| h | 2882 | 6.9% |
| p | 2758 | 6.6% |
| n | 2732 | 6.5% |
| m | 2582 | 6.2% |
| l | 1652 | 3.9% |
| Other values (10) | 9036 | 21.6% |
Space Separator
| Value | Count | Frequency (%) |
| 11404 | 100.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1202 | 100.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 124 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43098 | 78.9% |
| Common | 11528 | 21.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4836 | 11.2% |
| e | 4640 | 10.8% |
| i | 4014 | 9.3% |
| a | 3634 | 8.4% |
| s | 3130 | 7.3% |
| h | 2882 | 6.7% |
| p | 2758 | 6.4% |
| n | 2732 | 6.3% |
| m | 2582 | 6.0% |
| l | 1652 | 3.8% |
| Other values (11) | 10238 | 23.8% |
Common
| Value | Count | Frequency (%) |
| 11404 | 98.9% |
| , | 124 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54626 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 11404 | 20.9% |
| t | 4836 | 8.9% |
| e | 4640 | 8.5% |
| i | 4014 | 7.3% |
| a | 3634 | 6.7% |
| s | 3130 | 5.7% |
| h | 2882 | 5.3% |
| p | 2758 | 5.0% |
| n | 2732 | 5.0% |
| m | 2582 | 4.7% |
| Other values (13) | 12014 | 22.0% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 18 |
|---|
| Median length | 5 |
|---|
| Mean length | 9.967680608 |
|---|
| Min length | 5 |
|---|
Characters and Unicode
| Total characters | 10486 |
|---|
| Distinct characters | 17 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Things I have done |
|---|
| 2nd row | Faces |
|---|
| 3rd row | Things I have done |
|---|
| 4th row | Things I have done |
|---|
| 5th row | Things I have done |
|---|
| Value | Count | Frequency (%) |
| faces | 563 | 24.9% |
| things | 402 | 17.8% |
| i | 402 | 17.8% |
| have | 402 | 17.8% |
| done | 402 | 17.8% |
| names | 87 | 3.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1454 | 13.9% |
| 1206 | 11.5% |
| s | 1052 | 10.0% |
| a | 1052 | 10.0% |
| n | 804 | 7.7% |
| h | 804 | 7.7% |
| F | 563 | 5.4% |
| c | 563 | 5.4% |
| i | 402 | 3.8% |
| T | 402 | 3.8% |
| Other values (7) | 2184 | 20.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7826 | 74.6% |
| Uppercase Letter | 1454 | 13.9% |
| Space Separator | 1206 | 11.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1454 | 18.6% |
| s | 1052 | 13.4% |
| a | 1052 | 13.4% |
| n | 804 | 10.3% |
| h | 804 | 10.3% |
| c | 563 | 7.2% |
| i | 402 | 5.1% |
| g | 402 | 5.1% |
| v | 402 | 5.1% |
| d | 402 | 5.1% |
| Other values (2) | 489 | 6.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 563 | 38.7% |
| T | 402 | 27.6% |
| I | 402 | 27.6% |
| N | 87 | 6.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1206 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9280 | 88.5% |
| Common | 1206 | 11.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1454 | 15.7% |
| s | 1052 | 11.3% |
| a | 1052 | 11.3% |
| n | 804 | 8.7% |
| h | 804 | 8.7% |
| F | 563 | 6.1% |
| c | 563 | 6.1% |
| i | 402 | 4.3% |
| T | 402 | 4.3% |
| g | 402 | 4.3% |
| Other values (6) | 1782 | 19.2% |
Common
| Value | Count | Frequency (%) |
| 1206 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10486 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1454 | 13.9% |
| 1206 | 11.5% |
| s | 1052 | 10.0% |
| a | 1052 | 10.0% |
| n | 804 | 7.7% |
| h | 804 | 7.7% |
| F | 563 | 5.4% |
| c | 563 | 5.4% |
| i | 402 | 3.8% |
| T | 402 | 3.8% |
| Other values (7) | 2184 | 20.8% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 25 |
|---|
| Median length | 25 |
|---|
| Mean length | 24.29847909 |
|---|
| Min length | 19 |
|---|
Characters and Unicode
| Total characters | 25562 |
|---|
| Distinct characters | 23 |
|---|
| Distinct categories | 3 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Their voice changes |
|---|
| 2nd row | They avoid looking at you |
|---|
| 3rd row | The vibes I get from them |
|---|
| 4th row | The vibes I get from them |
|---|
| 5th row | The vibes I get from them |
|---|
| Value | Count | Frequency (%) |
| the | 674 | 11.8% |
| vibes | 674 | 11.8% |
| i | 674 | 11.8% |
| get | 674 | 11.8% |
| from | 674 | 11.8% |
| them | 674 | 11.8% |
| they | 255 | 4.5% |
| avoid | 255 | 4.5% |
| looking | 255 | 4.5% |
| at | 255 | 4.5% |
| Other values (4) | 624 | 11.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 4636 | 18.1% |
| e | 3320 | 13.0% |
| h | 1849 | 7.2% |
| o | 1817 | 7.1% |
| t | 1603 | 6.3% |
| i | 1430 | 5.6% |
| m | 1348 | 5.3% |
| T | 1052 | 4.1% |
| v | 1052 | 4.1% |
| g | 1052 | 4.1% |
| Other values (13) | 6403 | 25.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19200 | 75.1% |
| Space Separator | 4636 | 18.1% |
| Uppercase Letter | 1726 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3320 | 17.3% |
| h | 1849 | 9.6% |
| o | 1817 | 9.5% |
| t | 1603 | 8.3% |
| i | 1430 | 7.4% |
| m | 1348 | 7.0% |
| v | 1052 | 5.5% |
| g | 1052 | 5.5% |
| r | 797 | 4.2% |
| s | 797 | 4.2% |
| Other values (10) | 4135 | 21.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1052 | 61.0% |
| I | 674 | 39.0% |
Space Separator
| Value | Count | Frequency (%) |
| 4636 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20926 | 81.9% |
| Common | 4636 | 18.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3320 | 15.9% |
| h | 1849 | 8.8% |
| o | 1817 | 8.7% |
| t | 1603 | 7.7% |
| i | 1430 | 6.8% |
| m | 1348 | 6.4% |
| T | 1052 | 5.0% |
| v | 1052 | 5.0% |
| g | 1052 | 5.0% |
| r | 797 | 3.8% |
| Other values (12) | 5606 | 26.8% |
Common
| Value | Count | Frequency (%) |
| 4636 | 100.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25562 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4636 | 18.1% |
| e | 3320 | 13.0% |
| h | 1849 | 7.2% |
| o | 1817 | 7.1% |
| t | 1603 | 6.3% |
| i | 1430 | 5.6% |
| m | 1348 | 5.3% |
| T | 1052 | 4.1% |
| v | 1052 | 4.1% |
| g | 1052 | 4.1% |
| Other values (13) | 6403 | 25.0% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 38 |
|---|
| Median length | 32 |
|---|
| Mean length | 31.36882129 |
|---|
| Min length | 30 |
|---|
Characters and Unicode
| Total characters | 33000 |
|---|
| Distinct characters | 22 |
|---|
| Distinct categories | 4 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | I say "it's great to hear your voice!" |
|---|
| 2nd row | I say "it's great to hear your voice!" |
|---|
| 3rd row | I say "it's great to see you!" |
|---|
| 4th row | I give them a hug or a handshake |
|---|
| 5th row | I say "it's great to see you!" |
|---|
| Value | Count | Frequency (%) |
| i | 1052 | 13.3% |
| a | 968 | 12.2% |
| say | 568 | 7.2% |
| it's | 568 | 7.2% |
| great | 568 | 7.2% |
| to | 568 | 7.2% |
| see | 509 | 6.4% |
| you | 509 | 6.4% |
| give | 484 | 6.1% |
| them | 484 | 6.1% |
| Other values (6) | 1629 | 20.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6855 | 20.8% |
| e | 3156 | 9.6% |
| a | 3131 | 9.5% |
| t | 2188 | 6.6% |
| s | 2129 | 6.5% |
| h | 1995 | 6.0% |
| o | 1679 | 5.1% |
| g | 1536 | 4.7% |
| r | 1170 | 3.5% |
| y | 1136 | 3.4% |
| Other values (12) | 8025 | 24.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22821 | 69.2% |
| Space Separator | 6855 | 20.8% |
| Other Punctuation | 2272 | 6.9% |
| Uppercase Letter | 1052 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3156 | 13.8% |
| a | 3131 | 13.7% |
| t | 2188 | 9.6% |
| s | 2129 | 9.3% |
| h | 1995 | 8.7% |
| o | 1679 | 7.4% |
| g | 1536 | 6.7% |
| r | 1170 | 5.1% |
| y | 1136 | 5.0% |
| i | 1111 | 4.9% |
| Other values (7) | 3590 | 15.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| " | 1136 | 50.0% |
| ! | 568 | 25.0% |
| ' | 568 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 6855 | 100.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 1052 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23873 | 72.3% |
| Common | 9127 | 27.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3156 | 13.2% |
| a | 3131 | 13.1% |
| t | 2188 | 9.2% |
| s | 2129 | 8.9% |
| h | 1995 | 8.4% |
| o | 1679 | 7.0% |
| g | 1536 | 6.4% |
| r | 1170 | 4.9% |
| y | 1136 | 4.8% |
| i | 1111 | 4.7% |
| Other values (8) | 4642 | 19.4% |
Common
| Value | Count | Frequency (%) |
| 6855 | 75.1% |
| " | 1136 | 12.4% |
| ! | 568 | 6.2% |
| ' | 568 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33000 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6855 | 20.8% |
| e | 3156 | 9.6% |
| a | 3131 | 9.5% |
| t | 2188 | 6.6% |
| s | 2129 | 6.5% |
| h | 1995 | 6.0% |
| o | 1679 | 5.1% |
| g | 1536 | 4.7% |
| r | 1170 | 3.5% |
| y | 1136 | 3.4% |
| Other values (12) | 8025 | 24.3% |
| Distinct | 190 |
|---|
| Distinct (%) | 65.1% |
|---|
| Missing | 760 |
|---|
| Missing (%) | 72.2% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 818 |
|---|
| Median length | 234 |
|---|
| Mean length | 43.7260274 |
|---|
| Min length | 1 |
|---|
Characters and Unicode
| Total characters | 12768 |
|---|
| Distinct characters | 74 |
|---|
| Distinct categories | 12 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 4 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 158 ? |
|---|
| Unique (%) | 54.1% |
|---|
Sample
| 1st row | no |
|---|
| 2nd row | - |
|---|
| 3rd row | Improve their learning style |
|---|
| 4th row | no have. perfect |
|---|
| 5th row | it’s nice |
|---|
| Value | Count | Frequency (%) |
| to | 90 | 3.9% |
| i | 85 | 3.7% |
| the | 64 | 2.8% |
| 52 | 2.2% |
| no | 47 | 2.0% |
| learning | 47 | 2.0% |
| online | 41 | 1.8% |
| and | 40 | 1.7% |
| good | 36 | 1.5% |
| is | 35 | 1.5% |
| Other values (650) | 1788 | 76.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2076 | 16.3% |
| e | 1283 | 10.0% |
| t | 934 | 7.3% |
| o | 880 | 6.9% |
| n | 858 | 6.7% |
| s | 725 | 5.7% |
| a | 696 | 5.5% |
| i | 674 | 5.3% |
| r | 544 | 4.3% |
| l | 432 | 3.4% |
| Other values (64) | 3666 | 28.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10013 | 78.4% |
| Space Separator | 2076 | 16.3% |
| Uppercase Letter | 345 | 2.7% |
| Other Punctuation | 238 | 1.9% |
| Dash Punctuation | 39 | 0.3% |
| Decimal Number | 20 | 0.2% |
| Other Symbol | 12 | 0.1% |
| Currency Symbol | 12 | 0.1% |
| Control | 7 | 0.1% |
| Close Punctuation | 3 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1283 | 12.8% |
| t | 934 | 9.3% |
| o | 880 | 8.8% |
| n | 858 | 8.6% |
| s | 725 | 7.2% |
| a | 696 | 7.0% |
| i | 674 | 6.7% |
| r | 544 | 5.4% |
| l | 432 | 4.3% |
| h | 411 | 4.1% |
| Other values (17) | 2576 | 25.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 94 | 27.2% |
| N | 65 | 18.8% |
| T | 34 | 9.9% |
| O | 19 | 5.5% |
| S | 19 | 5.5% |
| G | 18 | 5.2% |
| M | 12 | 3.5% |
| F | 11 | 3.2% |
| L | 10 | 2.9% |
| A | 9 | 2.6% |
| Other values (13) | 54 | 15.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 126 | 52.9% |
| , | 59 | 24.8% |
| ' | 23 | 9.7% |
| ? | 14 | 5.9% |
| ! | 7 | 2.9% |
| / | 4 | 1.7% |
| ; | 2 | 0.8% |
| : | 2 | 0.8% |
| % | 1 | 0.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7 | 35.0% |
| 1 | 4 | 20.0% |
| 4 | 2 | 10.0% |
| 6 | 2 | 10.0% |
| 9 | 2 | 10.0% |
| 3 | 2 | 10.0% |
| 5 | 1 | 5.0% |
Space Separator
| Value | Count | Frequency (%) |
| 2076 | 100.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 39 | 100.0% |
Other Symbol
| Value | Count | Frequency (%) |
| â„¢ | 12 | 100.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 12 | 100.0% |
Control
| Value | Count | Frequency (%) |
| 7 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 | 100.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10358 | 81.1% |
| Common | 2410 | 18.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1283 | 12.4% |
| t | 934 | 9.0% |
| o | 880 | 8.5% |
| n | 858 | 8.3% |
| s | 725 | 7.0% |
| a | 696 | 6.7% |
| i | 674 | 6.5% |
| r | 544 | 5.3% |
| l | 432 | 4.2% |
| h | 411 | 4.0% |
| Other values (40) | 2921 | 28.2% |
Common
| Value | Count | Frequency (%) |
| 2076 | 86.1% |
| . | 126 | 5.2% |
| , | 59 | 2.4% |
| - | 39 | 1.6% |
| ' | 23 | 1.0% |
| ? | 14 | 0.6% |
| â„¢ | 12 | 0.5% |
| € | 12 | 0.5% |
| ! | 7 | 0.3% |
| 0 | 7 | 0.3% |
| Other values (14) | 35 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12732 | 99.7% |
| Letterlike Symbols | 12 | 0.1% |
| None | 12 | 0.1% |
| Currency Symbols | 12 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2076 | 16.3% |
| e | 1283 | 10.1% |
| t | 934 | 7.3% |
| o | 880 | 6.9% |
| n | 858 | 6.7% |
| s | 725 | 5.7% |
| a | 696 | 5.5% |
| i | 674 | 5.3% |
| r | 544 | 4.3% |
| l | 432 | 3.4% |
| Other values (61) | 3630 | 28.5% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| â„¢ | 12 | 100.0% |
None
| Value | Count | Frequency (%) |
| â | 12 | 100.0% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 12 | 100.0% |
| Distinct | 492 |
|---|
| Distinct (%) | 48.1% |
|---|
| Missing | 29 |
|---|
| Missing (%) | 2.8% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 74 |
|---|
| Median length | 44 |
|---|
| Mean length | 22.8172043 |
|---|
| Min length | 2 |
|---|
Characters and Unicode
| Total characters | 23342 |
|---|
| Distinct characters | 55 |
|---|
| Distinct categories | 7 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 322 ? |
|---|
| Unique (%) | 31.5% |
|---|
Sample
| 1st row | Fsktm |
|---|
| 2nd row | FSKTM |
|---|
| 3rd row | Education |
|---|
| 4th row | Law |
|---|
| 5th row | FCSIT |
|---|
| Value | Count | Frequency (%) |
| and | 338 | 10.7% |
| faculty | 271 | 8.6% |
| of | 262 | 8.3% |
| science | 166 | 5.3% |
| computer | 105 | 3.3% |
| business | 104 | 3.3% |
| information | 86 | 2.7% |
| technology | 86 | 2.7% |
| accountancy | 85 | 2.7% |
| engineering | 73 | 2.3% |
| Other values (217) | 1569 | 49.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2337 | 10.0% |
| n | 1939 | 8.3% |
| a | 1772 | 7.6% |
| i | 1566 | 6.7% |
| c | 1522 | 6.5% |
| e | 1479 | 6.3% |
| o | 1186 | 5.1% |
| t | 1130 | 4.8% |
| s | 899 | 3.9% |
| u | 892 | 3.8% |
| Other values (45) | 8620 | 36.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16894 | 72.4% |
| Uppercase Letter | 4037 | 17.3% |
| Space Separator | 2337 | 10.0% |
| Other Punctuation | 54 | 0.2% |
| Open Punctuation | 8 | < 0.1% |
| Close Punctuation | 8 | < 0.1% |
| Decimal Number | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1939 | 11.5% |
| a | 1772 | 10.5% |
| i | 1566 | 9.3% |
| c | 1522 | 9.0% |
| e | 1479 | 8.8% |
| o | 1186 | 7.0% |
| t | 1130 | 6.7% |
| s | 899 | 5.3% |
| u | 892 | 5.3% |
| l | 721 | 4.3% |
| Other values (14) | 3788 | 22.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 523 | 13.0% |
| S | 512 | 12.7% |
| F | 391 | 9.7% |
| I | 351 | 8.7% |
| C | 334 | 8.3% |
| E | 314 | 7.8% |
| T | 280 | 6.9% |
| M | 211 | 5.2% |
| N | 177 | 4.4% |
| B | 136 | 3.4% |
| Other values (13) | 808 | 20.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | 50.0% |
| 2 | 1 | 25.0% |
| 4 | 1 | 25.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 43 | 79.6% |
| , | 11 | 20.4% |
Space Separator
| Value | Count | Frequency (%) |
| 2337 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 8 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 8 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20931 | 89.7% |
| Common | 2411 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1939 | 9.3% |
| a | 1772 | 8.5% |
| i | 1566 | 7.5% |
| c | 1522 | 7.3% |
| e | 1479 | 7.1% |
| o | 1186 | 5.7% |
| t | 1130 | 5.4% |
| s | 899 | 4.3% |
| u | 892 | 4.3% |
| l | 721 | 3.4% |
| Other values (37) | 7825 | 37.4% |
Common
| Value | Count | Frequency (%) |
| 2337 | 96.9% |
| & | 43 | 1.8% |
| , | 11 | 0.5% |
| ( | 8 | 0.3% |
| ) | 8 | 0.3% |
| 1 | 2 | 0.1% |
| 2 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23342 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2337 | 10.0% |
| n | 1939 | 8.3% |
| a | 1772 | 7.6% |
| i | 1566 | 6.7% |
| c | 1522 | 6.5% |
| e | 1479 | 6.3% |
| o | 1186 | 5.1% |
| t | 1130 | 4.8% |
| s | 899 | 3.9% |
| u | 892 | 3.8% |
| Other values (45) | 8620 | 36.9% |
| Distinct | 562 |
|---|
| Distinct (%) | 54.9% |
|---|
| Missing | 29 |
|---|
| Missing (%) | 2.8% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 60 |
|---|
| Median length | 43 |
|---|
| Mean length | 16.35288368 |
|---|
| Min length | 1 |
|---|
Characters and Unicode
| Total characters | 16729 |
|---|
| Distinct characters | 69 |
|---|
| Distinct categories | 10 ? |
|---|
| Distinct scripts | 2 ? |
|---|
| Distinct blocks | 4 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 401 ? |
|---|
| Unique (%) | 39.2% |
|---|
Sample
| 1st row | Is |
|---|
| 2nd row | Information System |
|---|
| 3rd row | Language and literacy education |
|---|
| 4th row | Law |
|---|
| 5th row | IS |
|---|
| Value | Count | Frequency (%) |
| information | 91 | 4.3% |
| of | 81 | 3.8% |
| and | 77 | 3.6% |
| engineering | 76 | 3.6% |
| department | 65 | 3.1% |
| architecture | 55 | 2.6% |
| system | 51 | 2.4% |
| administration | 43 | 2.0% |
| chemical | 43 | 2.0% |
| 41 | 1.9% |
| Other values (334) | 1508 | 70.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1428 | 8.5% |
| a | 1304 | 7.8% |
| 1296 | 7.7% |
| i | 1279 | 7.6% |
| e | 1226 | 7.3% |
| t | 935 | 5.6% |
| o | 800 | 4.8% |
| r | 680 | 4.1% |
| s | 675 | 4.0% |
| c | 671 | 4.0% |
| Other values (59) | 6435 | 38.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12227 | 73.1% |
| Uppercase Letter | 3067 | 18.3% |
| Space Separator | 1296 | 7.7% |
| Dash Punctuation | 37 | 0.2% |
| Decimal Number | 30 | 0.2% |
| Other Punctuation | 26 | 0.2% |
| Close Punctuation | 22 | 0.1% |
| Open Punctuation | 22 | 0.1% |
| Currency Symbol | 1 | < 0.1% |
| Other Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1428 | 11.7% |
| a | 1304 | 10.7% |
| i | 1279 | 10.5% |
| e | 1226 | 10.0% |
| t | 935 | 7.6% |
| o | 800 | 6.5% |
| r | 680 | 5.6% |
| s | 675 | 5.5% |
| c | 671 | 5.5% |
| m | 549 | 4.5% |
| Other values (17) | 2680 | 21.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 402 | 13.1% |
| A | 375 | 12.2% |
| I | 323 | 10.5% |
| E | 309 | 10.1% |
| C | 175 | 5.7% |
| N | 164 | 5.3% |
| M | 163 | 5.3% |
| T | 163 | 5.3% |
| D | 145 | 4.7% |
| B | 126 | 4.1% |
| Other values (15) | 722 | 23.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 17 | 65.4% |
| , | 3 | 11.5% |
| . | 2 | 7.7% |
| ? | 2 | 7.7% |
| ' | 1 | 3.8% |
| # | 1 | 3.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 10 | 33.3% |
| 2 | 9 | 30.0% |
| 1 | 7 | 23.3% |
| 0 | 3 | 10.0% |
| 3 | 1 | 3.3% |
Space Separator
| Value | Count | Frequency (%) |
| 1296 | 100.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 37 | 100.0% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 22 | 100.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 22 | 100.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 1 | 100.0% |
Other Symbol
| Value | Count | Frequency (%) |
| â„¢ | 1 | 100.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 15294 | 91.4% |
| Common | 1435 | 8.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1428 | 9.3% |
| a | 1304 | 8.5% |
| i | 1279 | 8.4% |
| e | 1226 | 8.0% |
| t | 935 | 6.1% |
| o | 800 | 5.2% |
| r | 680 | 4.4% |
| s | 675 | 4.4% |
| c | 671 | 4.4% |
| m | 549 | 3.6% |
| Other values (42) | 5747 | 37.6% |
Common
| Value | Count | Frequency (%) |
| 1296 | 90.3% |
| - | 37 | 2.6% |
| ) | 22 | 1.5% |
| ( | 22 | 1.5% |
| & | 17 | 1.2% |
| 4 | 10 | 0.7% |
| 2 | 9 | 0.6% |
| 1 | 7 | 0.5% |
| 0 | 3 | 0.2% |
| , | 3 | 0.2% |
| Other values (7) | 9 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16726 | > 99.9% |
| None | 1 | < 0.1% |
| Currency Symbols | 1 | < 0.1% |
| Letterlike Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1428 | 8.5% |
| a | 1304 | 7.8% |
| 1296 | 7.7% |
| i | 1279 | 7.6% |
| e | 1226 | 7.3% |
| t | 935 | 5.6% |
| o | 800 | 4.8% |
| r | 680 | 4.1% |
| s | 675 | 4.0% |
| c | 671 | 4.0% |
| Other values (56) | 6432 | 38.5% |
None
| Value | Count | Frequency (%) |
| â | 1 | 100.0% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 | 100.0% |
Letterlike Symbols
| Value | Count | Frequency (%) |
| â„¢ | 1 | 100.0% |
| Distinct | 3 |
|---|
| Distinct (%) | 0.3% |
|---|
| Missing | 0 |
|---|
| Missing (%) | 0.0% |
|---|
| Memory size | 8.3 KiB |
|---|
Length
| Max length | 11 |
|---|
| Median length | 6 |
|---|
| Mean length | 7.633079848 |
|---|
| Min length | 6 |
|---|
Characters and Unicode
| Total characters | 8030 |
|---|
| Distinct characters | 17 |
|---|
| Distinct categories | 2 ? |
|---|
| Distinct scripts | 1 ? |
|---|
| Distinct blocks | 1 ? |
|---|
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Sample
| 1st row | Visual |
|---|
| 2nd row | Visual |
|---|
| 3rd row | Visual |
|---|
| 4th row | Visual |
|---|
| 5th row | Auditory |
|---|
| Value | Count | Frequency (%) |
| visual | 538 | 51.1% |
| auditory | 284 | 27.0% |
| kinesthetic | 230 | 21.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1282 | 16.0% |
| u | 822 | 10.2% |
| s | 768 | 9.6% |
| t | 744 | 9.3% |
| l | 538 | 6.7% |
| V | 538 | 6.7% |
| a | 538 | 6.7% |
| e | 460 | 5.7% |
| A | 284 | 3.5% |
| d | 284 | 3.5% |
| Other values (7) | 1772 | 22.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6978 | 86.9% |
| Uppercase Letter | 1052 | 13.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1282 | 18.4% |
| u | 822 | 11.8% |
| s | 768 | 11.0% |
| t | 744 | 10.7% |
| l | 538 | 7.7% |
| a | 538 | 7.7% |
| e | 460 | 6.6% |
| d | 284 | 4.1% |
| o | 284 | 4.1% |
| r | 284 | 4.1% |
| Other values (4) | 974 | 14.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 538 | 51.1% |
| A | 284 | 27.0% |
| K | 230 | 21.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8030 | 100.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1282 | 16.0% |
| u | 822 | 10.2% |
| s | 768 | 9.6% |
| t | 744 | 9.3% |
| l | 538 | 6.7% |
| V | 538 | 6.7% |
| a | 538 | 6.7% |
| e | 460 | 5.7% |
| A | 284 | 3.5% |
| d | 284 | 3.5% |
| Other values (7) | 1772 | 22.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8030 | 100.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1282 | 16.0% |
| u | 822 | 10.2% |
| s | 768 | 9.6% |
| t | 744 | 9.3% |
| l | 538 | 6.7% |
| V | 538 | 6.7% |
| a | 538 | 6.7% |
| e | 460 | 5.7% |
| A | 284 | 3.5% |
| d | 284 | 3.5% |
| Other values (7) | 1772 | 22.1% |